How to remove xml encoding from beautiful soup?

Question

I would like to know how i can remove the encoding automatically created by prettify in BeautifulSoup. Example:

tree='''
 
  
 
'''
from collections import defaultdict
from bs4 import BeautifulSoup as Soup
root = Soup(tree, 'lxml-xml')
print root.prettify().replace('
', '')

The output looks like

I would like simply:

Dean Fenster · Accepted Answer

There are a few ways you can go about it:

The first, call root.decode_contents(), which will give you a non-prettified content-only output.

Or prettify each chunk in contents separately and then join them. Like this: ' '.join(x.prettify() for x in root.contents).

How to remove xml encoding from beautiful soup?

Answers (1)

Related Questions