How to skip a particular tag and crawl other tag's text in Beautifulsoup

Question

I am crawling a webpage and i am using Beautifulsoup. There is a condition where i want to skip the content of one particular tag and get other tag contents. In the below code i don't want div tag contents. But i couldn't solve this. Please help me.

HTML code,


    
        unwanted text .....
    
    Text..............
    text 
    text
    text
    ,text

I have tried like this,

content = soup.find('blockquote',attrs={'class':'messagetext'}).text

But it is fetching unwanted text inside div tag also.

PepperoniPizza · Accepted Answer

Use the clear function like this:

soup = BeautifulSoup(html_doc)
content = soup.find('blockquote',attrs={'class':'messagetext'})

for tag in content.findChildren():
    if tag.name == 'div':
        tag.clear()

print content.text

This yields:

Text..............
text 
text
text
   ,text

How to skip a particular tag and crawl other tag's text in Beautifulsoup

Answers (1)

Related Questions

How to skip a particular tag and crawl other tag&#39;s text in Beautifulsoup

Answers (1)

Related Questions

How to skip a particular tag and crawl other tag's text in Beautifulsoup