Python convert HTML table to json

Question

I am having a html table like this. tried using pandas.read_html and beautifulsoup,. really frustrating, help please!!

here is my original python code:

url = 'http://financials.morningstar.com/ajax/keystatsAjax.html?t=wja&culture=en-CA®ion=CAN'
lm_json = requests.get(url).json()
ksContent = BeautifulSoup(lm_json["ksContent"],"html.parser")
table = ksContent.find("table", {'class': "r_table1 text2"})
jsonD = json.dumps(table.text)
jsonL = json.loads(jsonD)

the 'table' will have the html table, but the json conversion makes a pure text.

Shane Fontaine · Accepted Answer

jsonD = json.dumps(htmlContent.text) converts the raw HTML content into a JSON string representation. jsonL = json.loads(jsonD) parses the JSON string back into a regular string/unicode object. This results in a no-op, as any escaping done by dumps() is reverted by loads(). jsonL contains the same data as htmlContent.text.

Python convert HTML table to json

Answers (2)

Related Questions