Beautiful Soup BS4 "data-foo" associated text between tags not displaying

Question

From this Tag:

Sat 13 Aug 2011

I want to extract the "Sat 13 Aug 2011" using bs4 Beautiful Soup.

My current Code:

import requests
from bs4 import BeautifulSoup
url = 'https://www.premierleague.com/match/7468'
j = requests.get(url)
soup = BeautifulSoup(j.content, "lxml")

containedDateTag_string = soup.find_all('div', class_="matchDate renderMatchDateContainer")
print (containedDateTag_string)

When I run it the printed output does not contain the "Sat 13 Aug 2011" and is simply stored and printed as:

[]

Is there a way that I can get this string to be displayed? I have also tried parsing further through the tag with ".next_sibling" and ".text" with both displaying "[]" rather than the desired string which is why I reverted back to trying just 'div' to see if I could at least get the text to display.

Vin&#237;cius Figueiredo · Accepted Answer

Scraping the content using .page_source using selenium/ChromeDriver is the way to go here, since the date text is being generated by JavaScript:

from selenium import webdriver
from bs4 import BeautifulSoup

url = "https://www.premierleague.com/match/7468"
driver = webdriver.Chrome()
driver.get(url)

soup = BeautifulSoup(driver.page_source, 'lxml')

Then you can do your .find the way you were doing:

>>> soup.find('div', {'class':"matchDate renderMatchDateContainer"}).text

'Sat 13 Aug 2011'

A batteries included solution with selenium itself:

>>> driver.find_element_by_css_selector("div.matchDate.renderMatchDateContainer").text
'Sat 13 Aug 2011'

Beautiful Soup BS4 "data-foo" associated text between tags not displaying

Answers (2)

Related Questions

Beautiful Soup BS4 &quot;data-foo&quot; associated text between tags not displaying

Answers (2)

Related Questions

Beautiful Soup BS4 "data-foo" associated text between tags not displaying