XPath ignoring closing tags

Question

I have the following line in my xml file that I am trying to parse using XPath. The XML file itself was got by converting a PDF document using PDFtoHTML converter. As you can see, it has not added a closing tag for
here. So when I try to execute XPath by trying to capture the textValue of tag, it throws me an error saying
should be followed by a closing tag. How do I overcome this in XPath? Hovewer, when I open the file in a browser, everything is rendered fine in the browser.


  
    Dear Mr. AMIT KUMAR,
      
We are happy to enclose
31st March, 2011

Thanks Abhishek S

Paul Butcher · Accepted Answer

What you have posted is not XML. You cannot overcome this with XPath.

After generating the HTML, you could use HTML Tidy, to make it into XML, or you could try using a converter that converts PDF to well-formed XML.

XPath ignoring closing tags

Answers (1)

Related Questions