Extract text value in xml using python

Question

I using ElementTree to try and extract a number of values from an XML.

Here is a sample of the xml:-




76561197960964581
Team Fortress 2

    
        Scout.accum.iNumberOfKills
        1777
    
    
        Scout.accum.iPlayTime
        247469
    
    
        Scout.accum.iPointCaptures
        641
    
    
        Soldier.accum.iNumberOfKills
        1270
    
    
        Soldier.accum.iPlayTime
        94649
    
    
        Spy.accum.iNumberOfKills
        7489
    
    
        Spy.accum.iPlayTime
        1110582

There is a lot more, but this is just a sample.

I want to extract and sum all the values relating to "*.accum.iPlayTime", to calculate a total play time. Star implying all classes (e.g. scout, soldier, etc).

My code so far (including some of my attempts):-

playerStatsKISA = urllib2.urlopen('http://api.steampowered.com/ISteamUserStats/GetUserStatsForGame/v0002/?appid=440&key=xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx&steamid=xxxxxxxxxxxxxxxxx&format=xml')
statsTF2 = playerStatsTF2.read()
theStatsTF2 = ET.fromstring(statsTF2)

totalTimesTF2 = theStatsKISA.findtext("Scout.accum.iPlayTime") # Didn't find anything
print totalTimesKISA

totalTimesTF2 = theStatsKISA.findall("./stats/stat/name") 
for indiv in totalTimesTF2: # Another failed attempt
    print indiv.attrib # didn't extract anything, I gather because the text I'm after is not an attribute but a value?
    if indiv.attrib == 'Scout.accum.iPlayTime':
        print "got it" # would extract value here, but it would be long winded to do this then try and extract the next value I'm actually after.

I was going with the idea of acquiring the value from each class then summing it. Though I gather there is probably a away to get all the value in one foul swoop using a * for the TF2 class name, but I was going to do that after I first worked out how to get the value from the tag following one that contained the value I needed.

Hope this made sense.

Thanks.

falsetru · Accepted Answer

Use text attribute:

root = ET.fromstring(statsTF2)
for stat in root.findall("./stats/stat"):
    if stat.find('name').text.endswith('.accum.iPlayTime'):
        print stat.find('value').text

prints (given the xml in the question):

247469
94649
1110582

Using lxml with XPath:

import lxml.etree as ET

root = ET.fromstring(statsTF2)
for text in root.xpath('./stats/stat[name[contains(text(), ".accum.iPlayTime")]]/value/text()'):
    print text

Extract text value in xml using python

Answers (2)

Related Questions