Get element's text with CDATA

Question

Say, I have an element:

>>> el = etree.XML('')
>>> el.text
'content'

What I'd like to get is . How can I go about it?

Daniel Haley · Accepted Answer

When you do el.text, that's always going to give you the plain text content.

To see the serialized element try tostring() instead:

el = etree.XML('')
print(etree.tostring(el).decode())

this will print:

content

To preserve the CDATA, you need to use an XMLParser() with strip_cdata=False:

parser = etree.XMLParser(strip_cdata=False)

el = etree.XML('', parser=parser)
print(etree.tostring(el).decode())

This will print:

This should be sufficient to fulfill your "I want to make sure in a test that content is wrapped in CDATA" requirement.

Answers (2)