Beautiful Soup get nested span by class within another span

Question

Within a very large HTML page i want to get a span by class which is unique. The child span of this one, can be queried also by class but which is not unique.

...

   
      I am the child
   
 
...

Output should be "I am the child".

I have tried:

s = soup.select('span[class="uniqueParent"] > span[class="notUniqueChildClassName"]')
s.text

and

s = soup.find('span[class="uniqueParent"] > span[class="notUniqueChildClassName"]')
s.text

But both did not work.

Andrej Kesely · Accepted Answer

You can use CSS selector with dot (e.g .uniqueParent, instead of class="uniqueParent"):

from bs4 import BeautifulSoup


html_doc = """\

   
      I am the child
   
 """


soup = BeautifulSoup(html_doc, "html.parser")

print(soup.select_one(".uniqueParent .notUniqueChildClassName").text)

Prints:


      I am the child

Beautiful Soup get nested span by class within another span

Answers (2)

Related Questions