Unable to locate html element in Beautiful Soup

Question

Hello experts , I am working on a very challenging task. This is the HTML I have :

SC/ST: Minimum 18 Years and Maximum 35 Years OBC (Non-Creamy Layer): Minimum 18 Years and Maximum 33 Years Facebook Details Data: Data is always gathered valid Facebook Web Scraping

Districts: Candidates only from the following districts of Assam can apply for these posts:

This is the output i am trying to Achieve (remove the complete element which has facebook.com, the third line of the html should be removed, since it has facebook.com in it )

SC/ST: Minimum 18 Years and Maximum 35 Years OBC (Non-Creamy Layer): Minimum 18 Years and Maximum 33 Years

Districts: Candidates only from the following districts of Assam can apply for these posts:

This is the code I have tried :

getDetails = soup2.find('div', class_='post-body entry-content')
toRemove = "www.facebook.com"
try:
    for headless in (getDetails for getDetails in getDetails.find_all('a') if any( getDetails.find(toRemove))):
        headless.decompose()
except:
    print("facebook not found")

But, this code isnt working, the Output always has facebook.com in it. I have tried everything, but nothing works for me. Its quite a bit of challenge though. Please help me achieve the goal. Thanks

Unable to locate html element in Beautiful Soup

Answers (1)

Related Questions