Parsing html in with BeautifulSoup fails to find a table

Question

I am trying to parse the data in this website: http://www.baseball-reference.com/boxes/CHN/CHN201606020.shtml

I want to extract some of the data in the tables. But for some reason, I am struggling to find them. For example, what I want to do is this

from bs4 import BeautifulSoup
import requests

url = 'http://www.baseball-reference.com/boxes/CHN/CHN201606020.shtml'
soup = BeautifulSoup(requests.get(url).text)
soup.find('table', id='ChicagoCubsbatting')

The final line returns nothing despite a table with that id existing in the html. Furthermore, len(soup.findAll('table')) returns 1 even though there are many tables in the page. I've tried using the 'lxml', 'html.parser' and 'html5lib'. All behave the same way.

What is going on? Why does this not work and what can I do to extract the table?

Parsing html in with BeautifulSoup fails to find a table

Answers (1)

Related Questions