Unable to read the page with requests

Question

I want to read the page https://www1.hkexnews.hk/listedco/listconews/index/lci.html?lang=zh. Here is my code:

import requests
from bs4 import BeautifulSoup
headers = {'User-Agent': 'Mozilla/5.0 (Macintosh; Intel Mac OS X 10_11_5) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/50.0.2661.102 Safari/537.36'}
url = 'https://www1.hkexnews.hk/listedco/listconews/index/lci.html?lang=zh'
res = requests.get(url, headers = headers)
res.encoding = 'utf-8-sig'
soup = BeautifulSoup(res.text, 'lxml')

However, res.text contains no data of the page.

I also tried:

from requests_html import HTMLSession
session = HTMLSession()
r = session.get(url)
r.html.render()

It says: pyppeteer.errors.NetworkError: Protocol error Target.closeTarget: Target closed.

What should I do?

Imperishable Night · Accepted Answer

https://www1.hkexnews.hk/ncms/json/eds/lcisehk1relsdc_1.json

You are welcome.

In case you are curious, the "Network" tab of the DevTools is your friend.

Unable to read the page with requests

Answers (2)

Related Questions