beautiful soup - get tag desired text

Question

Very new to beautiful soup. I'm attempting to get the text between tags.

databs.txt

$343,343
Single
$101,900
Multi
$201,900
Single

Python

#!/usr/bin/python
import os
from bs4 import BeautifulSoup

f = open(os.path.join("databs.txt"), "r")
text = f.read()
soup = BeautifulSoup(text, 'html.parser')


page1 = soup.find('p').getText()
print("P1:",page1)
page2 = soup.find('h3').getText()
print("H3:",page2)

Question:

How do I get the text "$101,900, Multi, $201,900, Single"?

Rustam Garayev · Accepted Answer

If you want to get the tags that have attributes, you can use lambda function to get them as follows:

from bs4 import BeautifulSoup

html = """
$343,343
Single
$101,900
Multi
$201,900
Single
"""
soup = BeautifulSoup(html, 'lxml')


tags_with_attribute = soup.find_all(attrs=lambda x: x is not None)

clean_text = ", ".join([tag.get_text() for tag in tags_with_attribute])

Output would look like:

'$101,900, Multi, $201,900, Single'

beautiful soup - get tag desired text

Answers (2)

Related Questions