Using beautifulsoup to parse tag with some text

Question

Some html code contains some dt tags like follows:

PLZ:

8047

I want to find the text in the dd tag following the dt tag with the text PLZ:. According to documentation I am trying the following:

number = BeautifulSoup(text).find("dt",text="PLZ:").findNextSiblings("dd")

with text the above string, but all I get is an empty list instead the number I am looking for (as string of course). Maybe I misunderstand the documentation?

Vahid Chakoshy · Accepted Answer

so just try:

from BeautifulSoup import BeautifulSoup

text = """
PLZ:

8047
"""

number = BeautifulSoup(text).find("dt",text="PLZ:").parent.findNextSiblings("dd")
print BeautifulSoup(''.join(number[0]))

or if you find with findNext try:

number = BeautifulSoup(text).find("dt",text="PLZ:").parent.findNext("dd").contents[0]

Using beautifulsoup to parse tag with some text

Answers (2)

Related Questions