Reputation: 61331
I'm trying to load a chunk of HTML into MSXML's DOMDocument. The said chunk is valid XML with one excepton - it has
entities. MSXML chokes on them, claims "Reference to undefined entity 'nbsp'.".
Can I make MSXML recognize it as valid somehow?
Upvotes: 0
Views: 713
Reputation: 16907
Simple solution: Just run a text replacement of " " to " " before parsing the document. Which should work, since there cannot be a verbatim in the text, which should not be replaced.
More standard solution: Declare a nbsp; entity in the xml, by inserting
<!DOCTYPE foobar [
<!ENTITY nbsp " " >
]>
before the xml root node.
You can also use "0xA0" and   if you actually want a non-breaking space, instead of a normal space
Upvotes: 1