Reputation: 31
I'm trying to get a simple data from html files by using beautiful soup 4. But i have a problem when I encounter the mthml file. The html parser is not working in mhtml file. So, i need to convert mhtml files to simple html file. Or load mhtml file by other things, bcuz the main purpose is just extract one data.
Is anyone can help this by using python? (i already know I can easily convert this by using MS Word...but i want to do this by python programming by automatically)
Upvotes: 3
Views: 7701
Reputation: 121
There's a repo on github, named MHTifier, worth a look. Code is written Python2, it's readable and well commented. Although it's a work under progress but still can be a good starting point.
Upvotes: 1