Parse XML to CSV using Python nonetype error

Question

I am trying to parse XML file to CSV. However, I am getting the following error. I have tested the logic with another simple XML and it seems to work. I have provided below my error, the XML file, the python code, and my desired output. Right now I have only added two of my columns. Have been looking at this for hours so another set of eyes would be much appreciated. Thank you!

Error:

name = member.find('CaseName').tag AttributeError: 'NoneType' object has no attribute 'tag'

XML File:

 

  

    

      
      

      NATIVE
      C:\Users\KK132WQ\Desktop\Brooklyn Case - Nuix\OCR cache directory
      false
      false
      false
      false
      false
      false
      position

      Brooklyn
      C:\Users\KK132WQ\Desktop\Brooklyn Case - Nuix

      America/Chicago


      
        Document ID numbering
        true
        false
        DOC-000000001
      

      
        Default
      

      
        Page only
        Page only
      
      
          High Quality - Slow
          
          
          Auto
          English
      

      0.85

    

    
      4
      0
      4
      0
      
        
        
      
    

    
      

        
        

        
        
      
      

        
        

        
        
      
      

        
        

        
        
      
      

        
        

        
        
      
      

        
        

        
        
      
      

        
        

        
        
      
      

        
        

        
        
      
      

        
        

        
        
      
      

        
        

        
        
      
    

    
      3
      0
      0
      3
      0
      0
      0
      0
      0
      0
      0
      0
      0

      
        0
        0
        0
        0
        0
        0
        0
        0
        0
        0
      
    

    
      0
      0
      0
      0.0
    

    
      0.0857363321997085
      0.0
      0.0
      0.0
      0.0

Python Code:

    import xml.etree.ElementTree as ET
    import csv

    tree = ET.parse('D:\Users\eferse\Desktop\XML_parsing\summary-report.xml')
    root = tree.getroot()

    # open a file for writing

    Resident_data = open('D:\Users\eferse\Desktop\XML_parsing\Nuix Export XML Parse_PythonOutput.csv', 'w')

    # create the csv writer object

    csvwriter = csv.writer(Resident_data)
    resident_head = []

    count = 0
    for member in root.findall('Export'):
        resident = []
        address_list = []
        if count == 0:
            name = member.find('CaseName').tag
            resident_head.append(CaseName)
            location= member.find('CaseLocation').tag
            resident_head.append(CaseLocation)

            csvwriter.writerow(resident_head)
            count = count + 1

        name = member.find('CaseName').text
        resident.append(CaseName)
        location= member.find('CaseLocation').text
        resident.append(CaseLocation)


        csvwriter.writerow(resident)
    Resident_data.close()

Desired Output: Output

johnashu · Accepted Answer

I have used indexing to access the child elements in question. Sometimes this is easier to do when you know where the information is.

You can check this using the following

for child in root[0]:
    print(child.tag, child.attrib)

and you can navigate further by continuing the index as far as you like root[0][0][1] etc etc

You have to remember that the index is the parent and you are looking for the children. in your case root is Nuix which will return the children in this instance Export

root[0] is 'Export' which find will search the children and return what you want which is ExportConfiguration and inside here is what you are looking for CaseName and CaseLocation..

if you do

for child in root[0][0]:
    print(child.tag, child.attrib)

This will print the tags of CaseName etc but you will not be able to use find at this level. You will be searching inside CaseName for CaseName.

Once you have the parent you are able to find the children easier.

This code works.

I have taken the empty lists out of the loop.

I have also changed the append values as they did not have a variable, only a string name... I have also indented some appends as they were outside of the loop.

I have left the print statements in so you can see what is going on.

import xml.etree.ElementTree as ET
import csv

tree = ET.parse('summary-report.xml')
root = tree.getroot()

Resident_data = open('Parse_PythonOutput.csv', 'a')

    # create the csv writer object

csvwriter = csv.writer(Resident_data)
resident_head = []
resident = []
address_list = []

count = 0
for member in root[0]:
    if count == 0:

        name = member.find('CaseName').tag
        print(name)
        resident_head.append(name)

        location = member.find('CaseLocation').tag
        print(location)
        resident_head.append(location)

        csvwriter.writerow(resident_head)
        count = count + 1

        name_text = member.find('CaseName').text
        print(name_text)
        resident.append(name_text)

        text_location = member.find('CaseLocation').text
        print(text_location)
        resident.append(text_location)

        print(resident)

csvwriter.writerow(resident)

Resident_data.close()

The CSV data file looks like this:

CaseName,CaseLocation
Brooklyn,C:\Users\KK132WQ\Desktop\Brooklyn Case - Nuix

Parse XML to CSV using Python nonetype error

Answers (1)

Related Questions