Python read xml file with Chinese characters

Question

This is my example xml file

<設立案號>066143470<登記編號>4927872<工廠名稱>公司名稱<工廠地址>工廠地址

The problem I'm facing is after I read it into BeautifulSoup:

soup = BeautifulSoup (open("info.xml"), features="lxml")
page = soup.html.root
print(page.prettify())

The result I got is

066143470\u8a2d\u7acb\u6848\u865f>4927872\u767b\u8a18\u7de8\u865f>\u516c\u53f8\u540d\u7a31\u5de5\u5ee0\u540d\u7a31>\u5de5\u5ee0\u5730\u5740\u5de5\u5ee0\u5730\u5740>

Basically, the setting of the file is really messed up. How can I read in a file with all the Chinese character and structure preserved?

Thanks in advance.

Python read xml file with Chinese characters

Answers (1)

Related Questions