Unable to parse XML response and obtain elements

Question

This is my XML response from a http request




    
        
            US National Weather Service, National Centres for Environmental Prediction (NCEP)
        
        
            0
        
        
            2,1
        
        
            Forecast
        
        
            Analysis from GDAS (Global Data Assimilation System)
        
        
            GRIB-2
        
        
            CF-1.6
        
        
            Read using CDM IOSP GribCollection v3
        
        
            GRID
        
        
            ucar.nc2.dataset.conv.CF1Convention
        
    

    
        
            Hour since 2007-12-06T12:00:00Z
        
        
            time
        
        
            GRIB forecast or observation time
        
        
            proleptic_gregorian
        
        
            Time

I am trying to parse this XML content using Python 3.5

from xml.etree import ElementTree

response = requests.get("http://rda.ucar.edu/thredds/dodsC/aggregations/g/ds083.2/2/TP.ddx?time1")

tree = ElementTree.fromstring(response.content)

attr = tree.find("Attribute")
print(attr)

When I print this I get a None. What am I doing wrong? I also want to access the "Array" tag but that also returns None.

mhawke · Accepted Answer

The XML document uses namespaces so you need to support that in your code. There is an explanation and example code in the etree documentation.

Basically you can do this:

import requests
from xml.etree import ElementTree

response = requests.get('http://rda.ucar.edu/thredds/dodsC/aggregations/g/ds083.2/2/TP.ddx?time1')

tree = ElementTree.fromstring(response.content)

attr = tree.find("{http://xml.opendap.org/ns/DAP2}Attribute")

>>> print(attr)


# or declare the namespace like this
ns = {'dap2': 'http://xml.opendap.org/ns/DAP2'}
attr = tree.find("dap2:Attribute", ns)

>>> print(attr)

Unable to parse XML response and obtain elements

Answers (2)

Related Questions