Extracting a specific list item using Beautiful Soup 4

Question

I am trying to extract the "Balance" integer value from this webpage but am having trouble figuring out how to isolate that list item.

This is the code I currently have:

import bs4, requests

res = requests.get('https://live.blockcypher.com/btc/address/3CpfD1gBBdNW7orErj3YyNNSVpzndZ9aP9/')
res.raise_for_status()

soup = bs4.BeautifulSoup(res.text, 'html.parser')
elems = [elem for elem in soup.findAll('li') if 'Balance' in str(elem.text)]

print(elems)

However when I run it all I get is a [] instead of the real balance value.

Any ideas on where I am going wrong?

Keyur Potdar · Accepted Answer

To get the number, you can use this:

balance = soup.find('span', text='Balance').parent.contents[3].strip()
print(balance)

Output:

9.06451275 BTC

Explanation:

soup.find('span', text='Balance') will get you this Balance tag.

Using .parent.contents will give the contents of its parent tag as a list. In that list, the text you want is located in the 3rd index.

>>> for i, content in enumerate(soup.find('span', text='Balance').parent.contents):
...     print(i, content)
...
0

1 Balance
2 

3
            9.06451275 BTC


4 

5

6 
                (-0.0500349 BTC unconfirmed)
              
7

Extracting a specific list item using Beautiful Soup 4

Answers (1)

Related Questions