Scrape specific NHL score with Python Beautifulsoup

Question

I am trying to scrape only the total score for a specified team. I have written the following:

import urllib.request
import re
from bs4 import BeautifulSoup

#url1 = "http://scores.nbcsports.com/nhl/scoreboard.asp"

## This works, however is using a set day for testing, will need url changed to url1 for current day scoreboard
url = "http://scores.nbcsports.com/nhl/scoreboard.asp?day=20141202"
page = urllib.request.urlopen(url)
soup = BeautifulSoup(page)

allrows = soup.findAll('td')
userows = [t for t in allrows if t.findAll(text=re.compile('Vancouver'))]

print(userows)

This returns:

[Final
1
2
3
Tot


Vancouver
1
2
1
4


Washington
0
2
1
3


, Vancouver]

What I can't seem to get to is the 4 in 4 from the middle block. If it is only possible to get the 1 2 1 4 I could compare the values and always pick the largest, but I can't even seem to get that far. Thanks in advance.

alecxe · Accepted Answer

Find the tag containing Vancouver and get the next td tags by using find_next_siblings():

vancouver = soup.find('a', text='Vancouver')
for td in vancouver.parent.find_next_siblings('td', class_='shsTotD'):
    print(td.text)

Prints:

Scrape specific NHL score with Python Beautifulsoup

Answers (1)

Related Questions