XML scanning for value

Question

I have an XML with the following structure that I'm getting from an API -


    2397
    action_alert
    
        action_alert
        2

I am scanning for the ID by doing the following -

sourceobject = etree.parse(urllib2.urlopen(fullsourceurl))
source_id = sourceobject.xpath('//id/text()')[0]

I also want to get the tes:type

source_type = sourceobject.xpath('//tes:actions/tes:type/text()')[0]

Doesn't work. It gives the following error -

lxml.etree.XPathEvalError: Undefined namespace prefix

How do I get it to ignore the namespace?

Alternatively, I know the namespace which is this -

har07 · Accepted Answer

The proper way to access nodes in namespace is by passing prefix-namespace URL mapping as additional argument to xpath() method, for example :

ns = {'tes' : 'http://www.blah.com/client/servlet'}
source_type = sourceobject.xpath('//tes:actions/tes:type/text()', namespaces=ns)

Or, another way which is less recommended, by literally ignoring namespaces using xpath function local-name() :

source_type = sourceobject.xpath('//*[local-name()="actions"]/*[local-name()="type"]/text()')[0]

Answers (2)