web scraping with beautifulsoup hiding elements

Question

I am trying to scrape the following url with BeautifulSoup: https://www.investopedia.com/markets/stocks/aapl/#Financials

I have tried to parse this section which i found with inspect:

     
          
          1.43

MyCode is as followed:

import bs4 as bs
import requests

def load_ticker_invest(ticker):
resp = requests.get('https://www.investopedia.com/markets/stocks/{}/#Financials'.format(ticker))
    soup = bs.BeautifulSoup(resp.text, 'html.parser')
    trend = soup.div.find_all('div', attrs={'class':'value'})

    return trend

print (load_ticker_invest('aapl'))

What I get as result is a blank list:

[]

How can I solve this?

Keyvan Tajbakhsh · Accepted Answer

import requests
from selenium import webdriver
from selenium.webdriver.common.desired_capabilities import DesiredCapabilities
import bs4 as bs

caps = DesiredCapabilities().CHROME
caps["pageLoadStrategy"] = "normal"
driver = webdriver.Chrome(desired_capabilities=caps)
driver.get('https://www.investopedia.com/markets/stocks/aapl/#Financials')
resp = driver.execute_script('return document.documentElement.outerHTML')
driver.quit()

soup = bs.BeautifulSoup(resp, 'html.parser')
res = soup.find('div', attrs={'class':'text position'}).text
print (res)

web scraping with beautifulsoup hiding elements

Answers (2)

Related Questions