Getting elementtree tag texts by partial tag names

Question

In an XML document, I have an element with a DateTime tag, which can be extracted using:

for elem in xml_tree_root.iter(tag='DateTime'):
    print(elem.text)

in another version of the same XML file, the tag's name is blahblooDateTimebloobli. So I need something like:

for elem in xml_tree_root.iter(tag='*DateTime*'):
    print(elem.text)

that could work for both versions of the XML. But with the latter it doesn't work. It matches everything though, if I only put '*' which means in principle it must somehow work. My question is whether it is possible to feed regexp to elementtree iter search?

Wiktor Stribiżew · Accepted Answer

It looks as if you simply want to get the text of any tags that contain the DateTime substring.

In this case, you can use

values = [e.text for e in xml_tree_root.iter('*') if 'DateTime' in e.tag]
print(values)

That is, iterate over all the tags and if the tag name contains DateTime, get the node text value.

Getting elementtree tag texts by partial tag names

Answers (2)

iter(tag=None)

Related Questions