Find the parent tag of the most occurring tag - BeautifulSoup 4

Question

While working on a scraper with BeautifulSoup, I ran into a problem where I needed to find the parent tag of the most occuring

tag on a page. For Example:

I need to get the the tag which has the most direct children that are

elements. In the above example, it would be

since there are 3 p tags as opposed to .cls2 which only contain 2.

Any suggestions on how I would approach this or if this is entirely possible?

Andrej Kesely · Accepted Answer

You can use max() built-in function with custom key=:

data = '''
   
   

   



   
   

'''

from bs4 import BeautifulSoup

soup = BeautifulSoup(data, 'html5lib')

print(max(soup.select('div:has(> p)'), key=lambda k: len(k.findChildren('p', recursive=False))))

Prints:

Find the parent tag of the most occurring tag - BeautifulSoup 4

Answers (1)

Related Questions