Getting XML attributes from XML with namespaces and Python (lxml)

Question

I'm trying to grab the "id" and "href" attributes from the below XML. Thus far I can't seem to get my head around the namespacing aspects. I can get things easily enough with XML that doesn't have namespace references. But this has befuddled me. Any ideas would be appreciated!

Laurent LAPORTE · Accepted Answer

You can use xpath function to search all resources and iterate on them. The function has a namespaces keyword argument. The can use it to declare the mapping between namespace prefixes and namespace URL.

Here is the idea:

from lxml import etree

NS = {
    "ns5": "ers.ise.cisco.com",
    "ns3": "v2.ers.ise.cisco.com"
}

tree = etree.parse('your.xml')

resources = tree.xpath('//ns5:resource', namespaces=NS)

for resource in resources:
    print(resource.attrib['id'])
    links = resource.xpath('link')
    for link in links:
        print(link.attrib['href'])

sorry, this is not tested

Here is the documentation about xpath.

Getting XML attributes from XML with namespaces and Python (lxml)

Answers (2)

Related Questions