unable to parse a webpage using python

Question

I am trying to parse below webpage to get name of stocks hitting now all time high or low in the exchange.

https://www.bseindia.com/markets/equity/EQReports/HighLow.html?Flag=H#

however, when i download the webpage using beautiful soup and check the data i do not find the stock name or price mentioned in the webpage. I wish to write a function to download the stocks that hit a new all time high each day please help what am i missing?

gmds · Accepted Answer

Part of the HTML on the page is generated dynamically by JavaScript. You are most likely using the requests library, which cannot handle HTML generated in this way.

What you can do, instead, is use the Selenium library, which allows you to launch an instance of a web browser controlled by Python, and get the page source from there.

from selenium import webdriver

path = '...' # path to driver here
url = 'https://www.bseindia.com/markets/equity/EQReports/HighLow.html?Flag=H#'

driver = webdriver.Chrome(path)
page_source = driver.get(url).page_source

By parsing page_source with BeautifulSoup, you can get what you want.

unable to parse a webpage using python

Answers (1)

Related Questions