how to obtain class contents in html using beautifulsoup?

Question

This is my html code on which I wish to work:




    Market Cap:  10.64 Crores
    Current Price:  35.35
    Book Value:  53.52

My question is how to obtain the market cap, current price, book value from "class='col-sm-4'".

Beacuse if I try:

print soup.row.col-sm-4.fa.fa-inr

it does not work. I am kind of new to python and web scraping So please patiently walk trough the process. thanks in advance.

alecxe · Accepted Answer

You can find the labels by text and get the the next_element:

from bs4 import BeautifulSoup

data = """

        Market Cap:  10.64 Crores
        Current Price:  35.35
        Book Value:  53.52
    
"""
soup = BeautifulSoup(data)

titles = ['Market Cap', 'Current Price', 'Book Value']
for title in titles:
    print soup.find(text=lambda x: x.startswith(title)).next_element.text

Prints:

10.64 Crores
35.35
53.52

To get the float value, you can simply split by space and get the first element:

price = soup.find(text=lambda x: x.startswith(title)).strip().split()[0]
print float(price)

You can also get them by a CSS Selector:

for item in soup.select('section#price div.row h4.col-sm-4 b'):
    print item.text

how to obtain class contents in html using beautifulsoup?

Answers (2)

Related Questions