How to get status code in selenium chrome web driver in python

Question

I am looking for status_code in selenium but can't find any code that suits my need. My other problem is that when I enter a domain which does not exists lets say https://gghgjeggeg.com. Selenium does not raises any eror. It's page source is like:-

How can I get status code(for valid domains eg:https://twiitter.com/404errpage) as well as raise error for non existing domains in Selenium or is there any other library like Selenium?

try-catch-finally · Accepted Answer

Selenium is not meant to be used to directly examine HTTP status codes. Selenium is used to interact with the website like a user would do. And the typical user would not open the developer tools and observe the HTTP status code but look at the page content.

I even saw pages responding with a HTTP 200 OK delivering a "resource not found" message to the user.

Even the Selenium developers addressed this:

The browser will always represent the HTTP status code, imagine for example a 404 or a 500 error page. A simple way to “fail fast” when you encounter one of these error pages is to check the page title or content of a reliable point (e.g. the
tag) after every page load.

Source: selenium.dev / Worst practices / HTTP response codes

If you insist using Selenium you're better off finding the first h1 element and looking for the typical Chrome 404 signature:

h1 = driver.find_element_by_css_selector('h1')
if h1.text == u"This site can’t be reached":
    print("Not found")

Although, if you want to crawl websites, you might even use urllib, like Tek Nath suggested in the comments:

import urllib.request
import urllib.request
import urllib.error

try:
    with urllib.request.urlopen('http://www.safasdfsadfsadfdsf.org/') as f:
        print(f.read())
        print(f.status)
        print(f.getheader("content-length"))
except urllib.error.URLError as e:
    print(e.reason)

Since the domain is not existing, the code will run into the exception handler branch.

See the Python documentation for details and more examples:

You might then want to use a DOM parser to process the HTML markup to a DOM tree for easier processing. Though this is beyond this question - get started here:

xml.dom (Python documentation)
"Python: Is there a built in package to parse html into dom" (Stackoverflow)

How to get status code in selenium chrome web driver in python

Answers (2)

Related Questions