Using libxml2 to Parse XML Attributes Containing Invalid Characters

Question

I am attempting to parse XML response messages from a third-party interface that contain illegal characters. Please note that these responses are not within my control.

The following is a modified example response

Occasionally, the "value" attribute might contain the ESC control character [0x1b], which is used (questionably) to indicate special characteristics to be applied to the value.

I'm using the libxml2 xmlParseMemory() function to attempt to parse this response. http://www.xmlsoft.org/html/libxml-parser.html#xmlParseMemory

I'm calling the function as as follows:

xmlDocPtr doc = xmlParseMemory( buffer, size );

When the response XML is valid, I get a valid xmlDocPtr and can continue to work with it. If the response contains illegal characters, I receive NULL and wind up at a dead end.

Is there any way I parse these messages without receiving errors and without throwing away the illegal characters?

Using libxml2 to Parse XML Attributes Containing Invalid Characters

Answers (1)

Related Questions