Selenium: how to get the entire html as a string?

Question

I am using Selenium with python. See the following code:

from selenium.webdriver.common.keys import Keys
import selenium.webdriver
driver = selenium.webdriver.Firefox()
driver.get("http://finance.yahoo.com/q?s=APP")

Now, I want to do one simple thing: get the html of that web as a string from the driver. Then, I can use BeautifulSoup to parse it. Does anyone know this?

Actually, I don't how to access information from this driver, e.g., to get the stock price of apple in this case.

I am totally new to Selenium. A good tutorial for it is highly appreciated.

Thank you!

Gregory Kremler · Accepted Answer

You're looking for page_source.

To continue along with your example:

soup = BeautifulSoup(driver.page_source)

As another commenter noted however, you could use a library like requests to the same effect:

r = requests.get('http://finance.yahoo.com/q?s=APP')
soup = BeautifulSoup(r.content)

Selenium: how to get the entire html as a string?

Answers (2)

Related Questions