problem to extract a text with Beautiful soup using python

Question

I'm trying to extract texts from the forum website, it works good but if there are 2 lines in one comment it extracts the first line in the comment. see examples below

             
   Happy birthday bro! 

    Have a nice day

r = requests.get("https://example.com/threads/73956/page2", headers=headers, cookies=cookies)
soup = BeautifulSoup(r.content, "html.parser")
comments = soup.find_all('div',{'class':'wwCommentBody'})
for div in comments:
    text = (div.find('blockquote',{'class':'postcontent restore'}))
    first_child = next(text.children, None)
    if first_child is not None:
        print(first_child.string.strip())

Ram · Accepted Answer

Just extract the blockquote and print it's text.

for div in comments:
    bq = div.find('blockquote',{'class':'postcontent restore'})
    print(bq.text)

problem to extract a text with Beautiful soup using python

Answers (1)

Related Questions