BeautifulSoup: get tag text behind another tag

Question

How to find tag by another tag using BeautifulSoup? In this example I want to get for example '0993 999 999' which is in div right behind another div with 'Telefon:' text.

I tried to get it using this:

print parsed.findAll('div',{'class':"dva" })[3].text

But It does not work properly. I think there must be a way to tell BeautifulSoup that it is right behind 'Telefon' text or another way.

 
        Kontaktné údaje
        
             SNP 12, 904 01 Pezinok 
        Telefon:
        013 / 688 27 78
        Mobil:
        0993 999 999
        Fax:
033 / 690 97 94
        E-mail:

EDIT: I tried this, does not works neither.

tags = parsed.findAll('div',{'class':"jedna"})
for tag in tags:
    if tag.text=='Telefon:':
        print tag.next_siebling.string

Could you guys please give me a hint how to do that? Thanks!

alecxe · Accepted Answer

You can use find_next_sibling():

# -*- coding: utf-8 -*-
from bs4 import BeautifulSoup

data = u"""html here"""

soup = BeautifulSoup(data)
print soup.find('div', text='Telefon:').find_next_sibling('div', class_='dva').text
print soup.find('div', text='Mobil:').find_next_sibling('div', class_='dva').text

Prints:

013 / 688 27 78
0993 999 999

FYI, you can extract the duplication and have a nice reusable function:

def get_field_value(soup, field):
    return soup.find('div', text=field+':').find_next_sibling('div', class_='dva').text

soup = BeautifulSoup(data)
print get_field_value(soup, 'Telefon')  # prints 013 / 688 27 78
print get_field_value(soup, 'Mobil')  # prints 0993 999 999

Hope that helps.

BeautifulSoup: get tag text behind another tag

Answers (1)

Related Questions