Parsing amazon xml in python

Question

I Got following XML from Amazon Web-Service.


    
        
            8789797
        
        
            
                
                    google.com/
                    1
                
            
        
        
            Success

When i try to extract the rank.

xmldoc = minidom.parse(response)
itemlist = xmldoc.getElementsByTagName('aws:Rank')[0]

xmlData=itemlist.replace('','').replace('','')
print xmlData

It give me error.

AttributeError: Element instance has no attribute 'replace'

nukleas · Accepted Answer

The issue here is that you are trying to use replace on an XML element, which is not a list and isn't a string, which would have the .replace().

Since you are picking out the element (which is an Element object) by using =getElementsByTagName('aws:Rank')[0], you only have one thing to work on.

the data that you want can be reached with:

itemlist.firstChild.data

or

itemlist.firstChild.nodeValue

(@root, you had this right, I don't know why you got downvoted)

Now I had some trouble parsing that XML because the namespace wasnt bound, but that wasn't a biggie.

what would likely be clearer is the snippet as such:

xmldoc = minidom.parse(response)
xmlElement = xmldoc.getElementsByTagName('aws:Rank')[0]
xmlData = xmlElement.firstChild.nodeValue
print xmlData

But in all honesty, you will probably want to check out the info on the Element object in minidom:

http://docs.python.org/2/library/xml.dom.html#dom-element-objects

Parsing amazon xml in python

Answers (1)

Related Questions