Python Requests gets different HTML data than Browser; JS seems irrelevant

Question

I'm trying to scrape the weather data from this website:

http://www.fastweather.com/yesterday.php?city=St.+Louis_MO

The problem I have run into is Yesterday's Precipitation. When viewed in the developer tools, I see the following:

Yesterday's Precipitation
was 0.13 inches

But when viewing it from Python, either using Requests or the urllib modules, I see this:

Yesterday\'s Precipitation
was T inches

I use NoScript in my browser, and I disallowed all JavaScript from running, but the 0.13 still appears. Where is this number coming from, and how do I obtain it with Python?

I'm on a Unix system, and this will be a daily script to run. I would like to avoid Selenium, if possible.

Even if there are other websites to use, I would like to know why that mysterious T exists.

Here's my relevant code:

webpage = requests.get("http://www.fastweather.com/yesterday.php?city=St.+Louis_MO")
if webpage.status_code == 200:
    content = str(webpage.content)

I have also tried this:

with requests.Session() as session:
    webpage = session.get("http://www.fastweather.com/yesterday.php?city=St.+Louis_MO")
    content = webpage.text

And this:

webpage = urllib.request.urlopen("http://www.fastweather.com/yesterday.php?city=St.+Louis_MO")
content = webpage.read()

(There may be minor mistakes in the above code since I can't remember exactly how each method works.)

Python Requests gets different HTML data than Browser; JS seems irrelevant

Answers (1)

Related Questions