Reputation: 51
Here is the part of the web page:
<div class="MPinfo">
<ul class="frontList">
<li>some text</li>
<li>some text</li>
<li>some text</li>
<li>some text</li>
<li>some text</li>
<li>some text
<a href="/some_local_link/8976">some text</a>;
<a href="/some_local_link/8943">some text</a>;
</li>
<li>E-mail:
<a href="mailto:[email protected]">[email protected]</a>
</li>
</ul>
</div>
I am trying to get the div by its class and then extract the email link just to email itself like: [email protected]
page = urllib.urlopen(link)
soup = BeautifulSoup(page.read())
print soup.find('div', attrs={'class': 'MPinfo'})
I have tried several ways to get the div but it returns empty list or None
Upvotes: 5
Views: 2414
Reputation: 11
You can select all li under the div, it will be a list, so you can select last li element like [-1]
>>> soup.find("div",attrs={"class":"MPinfo"}).find_all("li")[-1].a.text
'[email protected]'
Upvotes: 1