How to get all the tags (with content) under a certain class with BeautifulSoup?

Question

I have a class in my soup element that is the description of a unit.


 Here is a paragraph
 inner div
 Another div
 
    Item1
    Item2
    Item3

I can easily grab this part with soup.select(".ats-description")[0]. Now I want to remove

, only to keep all the inner tags (to retain text structure). How to do it?

soup.select(".ats-description")[0].getText() gives me all the texts within, like this:

'
Here is a paragraph
inner div
Another div

Item1
Item2
Item3


'

But removes all the inner tags, so it's just unstructured text. I want to keep the tags as well.

uingtea · Accepted Answer

to get innerHTML, use method .decode_contents()

innerHTML = soup.select_one('.ats-description').decode_contents()
print(innerHTML)

Answers (2)