python-requests and urllib not giving the same HTML as seen in browser, target site only contains text (no apparent scripts)

Question

I have the following url: https://tenhou.net/3/mjlog2xml.cgi?2009042400gm-00b9-0000-3a2a55dc

It simply contains text, and I want to download it and store it on my disk as an xml file using Python. I'm using the requests module. Here is what I've tried doing:

import requests

url = "https://tenhou.net/3/mjlog2xml.cgi?2009042400gm-00b9-0000-3a2a55dc"

r = requests.get(url, allow_redirects=True)
open('test.xml', 'wb').write(r.content)

When I go to inspect the contents of test.xml, it only contains the text "PLEASE DOWNLOAD RAW FILE". I've also tried using urllib.request.urlopen(), but I get the same result.

However when I open the url in a browser, I see the full markup text, and I can even download the page as save it as an xml.

The HTML that I receive, using the requests method, is:


   
      PLEASE DOWNLOAD RAW FILE
   
>

But the HTML on the site is like this

The text that I want to download is on the left. The HTML is displayed on the right. If I can just get the HTML that's on the right, then I know how to use something like BeautifulSoup to parse it and get what I want. But I'm not sure why python-requests and urllib is not giving me the right data.

python-requests and urllib not giving the same HTML as seen in browser, target site only contains text (no apparent scripts)

Answers (1)

Related Questions