Reputation: 3203
I am converting PDF document into HTML using poppler utility. But its creating separate html file for each page, but I want a single HTML file after converting pdf to html.
I used following syntax:
pdftohtml -c abc.pdf
But its creating abc-1.html, abc-2.html, .... etc
I also tried with pdftohtml -c abc.pdf abc.html
but not getting expecting output.
Can anyone tell how to get a html output in single file instead of multiple html files?
Upvotes: 4
Views: 4246
Reputation: 423
The question is old but needs to be supplemented with information
The solution with pdftohtml works but loses the formatting
Preserves formatting is pdf2htmlEX
On new builds of Linux, this package is not present and deb packages are not installed
sudo docker pull bwits/pdf2htmlex
sudo docker run -ti --rm -v /home/user/Documents/pdfToHtml:/pdf bwits/pdf2htmlex pdf2htmlEX --zoom 1.3 file.pdf
Upvotes: 0
Reputation: 3203
I have achieved this with -s option
For example:
pdftohtml -c -s -noframes abc.pdf abc.html
Upvotes: 5