Python - Parse XML with repeated tags using ElementTree

Question

I have the following XML content:



    Version1
    Sub Version2
    
        1
        
            ID1
            NameFrank
        
        2
        
            ID2
            NameRichard
        
        3
        
            ID3
            NameSophia
        
    
    Persons
    
        
            NamePersons
            Descriptionempty

I'm having a hard time retrieving the names since this XML tags names are all the same and have no attributes. So far I've tried to access it using iteration over the "second depth dict" but I can't retrieve just what I want.

What I got:

from xml.etree import ElementTree as et

tree = et.parse("file.xml")
root = tree.getroot()

for i in root.find('dict').find('dict').iter('dict'):
    print ([j.text for j in i])

The output I want:

Frank
Richard
Sophia

Does anyone know how to access these values with such tags?

Jack Fleeting · Accepted Answer

Try it using lxml instead:

from lxml import etree
plist = """your xml above"""

doc = etree.fromstring(plist)
doc.xpath('//dict/dict/key["name"]/following-sibling::string/text()')

output:

['Frank', 'Richard', 'Sophia']

Python - Parse XML with repeated tags using ElementTree

Answers (1)

Related Questions