Python requests text only returning ï»¿ï»¿ instead of HTML

Question

I'm trying to scrape the link to a file to download later from a website.

My code:

outage_page = 'https://www.oasis.oati.com/cgi-bin/webplus.dll?script=/woa/woa-planned-outages-report.html&Provider=MISO'

s = requests.Session()

req = s.get(outage_page, stream=True, verify='my cert path is here')

print(req, ' ', req.headers, ' ', req.raw, ' ', req.encoding, ' ', req.content, ' ', req.text)

This is the output I get:

{'Content-Type': 'text/html', 'Content-Encoding': 'gzip', 'Vary': 'Accept-Encoding', 'Server': 'Microsoft-IIS/7.5', 'X-Powered-By': 'ASP.NET', 'X-Content-Type-Options': 'nosniff', 'Strict-Transport-Security': 'max-age=31536000; includeSubDomains', 'Date': 'Mon, 26 Aug 2019 15:48:39 GMT', 'Content-Length': '136'}

ISO-8859-1

b'\xef\xbb\xbf\xef\xbb\xbf '

ï»¿ï»¿

Process finished with exit code 0

I expected req.text to return the html I could scrape, but it only returns ï»¿ï»¿. The other print statements are just for reference here. What am I doing wrong?

Python requests text only returning ï»¿ï»¿ instead of HTML

Answers (1)

Related Questions

Python requests text only returning &#239;&#187;&#191;&#239;&#187;&#191; instead of HTML

Answers (1)

Related Questions

Python requests text only returning ï»¿ï»¿ instead of HTML