BeautifulSoup: Print div's based on content of preceding tag

Question

I would like to select the contents of elements based on the preceding tag:

Models & Products
    ...

Production Capacity (year)
    ...

How can I get the "profile-area" values based on the content of the preceding tag?

Here is my code:

import requests
from bs4 import BeautifulSoup
import csv
import re

html_doc = """


  
    
  

  
    Models & Products

    
      Large Buses, Trucks, Trailer-heads
    

    Production Capacity (year)

    
      Vehicle 700 units /year
    

    Output

    
      Vehicle 356 units ( 2016 )
    

    
      Vehicle 477 units ( 2015 )
    

    
      Vehicle 760 units ( 2014 )
    

    
      Vehicle 647 units ( 2013 )
    
  


"""
soup = BeautifulSoup(html, 'lxml')

#link=soup.iframe.get('src')
#print(link.split("%2C"))

for item in soup.select("div.profile-area"):
    print(item.text)

As you can see I'm also trying to split the Google Maps link into coordinates, but this I will figure out probably on my own.

Thanks for your help!

Druta Ruslan · Accepted Answer

Use .find_previous_sibling() to explicitly find the first preceding h4 tag:

for item in soup.select("div.profile-area"):
    prev_h4 = item.find_previous_sibling('h4').text
    if 'Capacity' in prev_h4:
        print(item.text)

Output

Vehicle 700 units /year

BeautifulSoup: Print div's based on content of preceding tag

Answers (1)

Related Questions

BeautifulSoup: Print div&#39;s based on content of preceding tag

Answers (1)

Related Questions

BeautifulSoup: Print div's based on content of preceding tag