BeautifulSoup as XML parser produces an unwanted html/body

Question

When using BeautifulSoup for XML:

import bs4
soup = bs4.BeautifulSoup('', 'lxml')
# add or remove tags in soup
print(soup)

the output has an unnecessary and :

How to avoid these HTML-specific elements and output an XML with BeautifulSoup?

This is not a valid solution:

print(soup.find('mydocument'))

because it removes the , which I want to keep.

Jack Fleeting · Accepted Answer

Try one of these:

my_xml = ''
soup = bs4.BeautifulSoup(my_xml, "xml")

or

soup = bs4.BeautifulSoup(my_xml, "lxml-xml")

in either case print(soup) should output:

Answers (1)