Python: Which XML parser supports DTD !ENTITY definitions?

Question

I have the below XML file, currently I am using minidom and I get for the example the documentElement's tagName as being xyz:widget that tells me that it ignores the !ENTITY definitions and thus the!DOCTYPE reference.

Which XML parser supports Document Type Definitions so that !ENTITY definitions and !DOCTYPE reference will no be ignored:




]>

  
  bv

So that for the above example, you can get using python the XML equivalent:

bv

or to get a DOM that has as a documentElement as widget and its childNodes as content and name, widget attribute as xmlns with value http://www.w3.org/ns/widgets, etc

I probably may not used the correct terminology, but I hope I made myself clear with the help of the above examples.

Fred Foo · Accepted Answer

LXML handles this just fine:

>>> from lxml import etree
>>> s = """
... 
... 
... ]>
... 
...   
...   bv
... 
... """
>>> etree.fromstring(s)

>>> etree.fromstring(s).xpath("//xyz:content/@src",
...                           namespaces={"xyz": "http://www.w3.org/ns/widgets"})
['pass&.html']

Python: Which XML parser supports DTD !ENTITY definitions?

Answers (1)

Related Questions