lxml XSLT removes CDATA while processing XML

Question

Handling CDATA with lxml involves making parser with suitable declaration, but how about XSLT? For example:

from lxml import etree

parser = etree.XMLParser(strip_cdata=False)
tree = etree.parse('sample_with_cdata.xml', parser)
transform = etree.XSLT(etree.parse('dupe.xsl'))
xml_out = transform(tree)
xml_out.write('processed.xml')

If I process xml file with CDATA through lxml XSLT processor, all CDATA is stripped. How can I tell XSLT processor to leave CDATA as is?

PS. FYI, adding same parser to etree.XSLT doesn't change outcome

Michael Kay · Accepted Answer

As far as XSLT is concerned, CDATA sections in XML are just noise. XSLT treats the same as " which it treats the same as "; they are different ways for the document author to write the same thing.

If you are using CDATA sections in your input to convey information, that is if means something different from xxx, then you need to change your XML design.

lxml XSLT removes CDATA while processing XML

Answers (2)

Related Questions