itext pdf generation fail on parsing some html tags

Question

I have this html code, which reside in db and I want to parse it in pdf. I am using itext for pdf generation. here is the html in db:

no note.




section






first


second


third

and here is what is parsed and inserted into pdf:

no note.


section

first

second

third

and also here is my code to parse the html into pdf:

org.jsoup.nodes.Document doc = Jsoup.parse(text);
List objects;
objects = HTMLWorker.parseToList(new StringReader(doc.outerHtml()), null);
for (Element object : objects) {
        Element ele = (Element) object;
        document.add(ele);
}

as can be seen numbers and bullet are not shown (which are "ol" and "li" tags in html). How to solve this?

Edit

For more clarification. Here is the text I have in html:

enter image description here

and here is the note inserted into pdf:

enter image description here

pms · Accepted Answer

my friend just solved it:

XMLWorkerHelper.getInstance().parseXHtml(new XHtmlElementHandler(document), new StringReader(text));

simple :)

itext pdf generation fail on parsing some html tags

Answers (2)

Related Questions