Parsing specific field in XML file in Python

Question

I have an xml file that looks like this:



  DailyTreasuryYieldCurveRateData
  http://data.treasury.gov:8001/feed.svc/DailyTreasuryYieldCurveRateData
  2015-08-30T15:17:09Z
  
  
    http://data.treasury.gov:8001/Feed.svc/DailyTreasuryYieldCurveRateData(6404)
    
    2015-08-30T15:17:09Z
    
      
    
    
    
    
      
        6404
        2015-08-03T00:00:00
        0.03
        0.08
        0.17
        0.33
        0.68
        0.99
        1.52
        1.89
        2.16
        2.55
        2.86
        2.86
      
    
  
  
    http://data.treasury.gov:8001/Feed.svc/DailyTreasuryYieldCurveRateData(6405)
    
    2015-08-30T15:17:09Z
    
      
    
    
    
    
      
        6405
        2015-08-04T00:00:00
        0.05
        0.08
        0.18
        0.37
        0.74
        1.08
        1.6
        1.97
        2.23
        2.59
        2.9
        2.9

How can I parse out the '2.16' for 'BC_10YEAR'? I've been looking at other examples with ElementTree and lxml and I just can't seem to match up the xml format in those examples with that of my file.

The last thing I've tried was:

from lxml import etree
doc = etree.parse(yield_xml)
memoryElem = doc.find('content')
print memoryElem.text        # element text
print memoryElem.get('type') # attribute

I get an error: AttributeError: 'NoneType' object has no attribute 'text'

Is there a simple way to do this?

mmachine · Accepted Answer

You may try built-in split method:

>>>[data.split('>')[1].split('<')[0] for data in str(xml_file).split('

Parsing specific field in XML file in Python

Answers (2)

Related Questions