Python 3: Unable to convert XML to dict using xmltodict

Question

I am trying to convert data from an XML file to python dict, but am unable to do so. Following is the code I'm writing.

import xmltodict
input_xml  = 'data.xml'  # This is the source file

with open(input_xml, encoding='utf-8', errors='ignore') as _file:
    data = _file.read()
    data = xmltodict.parse(data,'ASCII')
    print(data)
    exit()

On executing this code, following is the error I'm getting:
xml.parsers.expat.ExpatError: not well-formed (invalid token): line 239, column 40.
After multiple hits and trials, I realized that my xml has some characters in Hindi language, inside a particular tag, as shown below

!! आप की सेवा में पुनः पधारे !!

How I can ignore these unencoded characters before running xmltodict.parse?

Python 3: Unable to convert XML to dict using xmltodict

Answers (1)

Related Questions