From escaped html - to regular html? - Python

Question

I used BeautifulSoup to handle XML files that I have collected through a REST API.

The responses contain HTML code, but BeautifulSoup escapes all the HTML tags so it can be displayed nicely.

Unfortunately I need the HTML code.

How would I go on about transforming the escaped HTML into proper markup?

Help would be very much appreciated!

Alex Martelli · Accepted Answer

I think you want xml.sax.saxutils.unescape from the Python standard library.

E.g.:

>>> from xml.sax import saxutils as su
>>> s = '<foo>bar</foo>'
>>> su.unescape(s)
'bar'

From escaped html -> to regular html? - Python