dark horse
dark horse

Reputation: 447

Python - Error when trying to convert xml to csv

I have the below code that reads a xml file and tries to convert it to csv. The below works fine, however when the data has one additional sub-level it throws an error child index out of range

Given below is the data set I am trying to work with:

<?xml version="1.0" encoding="utf-8" standalone="yes" ?>
<Document>
  <Customer>
    <CustomerCode>ABC</CustomerCode>
    <CustomerName>ABC Co</CustomerName>
    <CustomerBusinessHours>
        <CustomerBusinessHoursTimeZoneOffset>1.000000</CustomerBusinessHoursTimeZoneOffset>
    </CustomerBusinessHours>
  </Customer>
</Document>

Code that I have tried building:

import xml.etree.ElementTree as ET
import csv


tree = ET.parse("/users/desktop/sample.xml")
root = tree.getroot()

# open a file for writing

Resident_data = open('/users/desktop/file.csv', 'w')

# create the csv writer object

csvwriter = csv.writer(Resident_data)
resident_head = []

count = 0
for member in root.findall('Customer'):
    resident = []
    address_list = []
    if count == 0:
        CustomerCode = member.find('CustomerCode').tag
        resident_head.append(CustomerCode)
        CustomerName = member.find('CustomerName').tag
        resident_head.append(CustomerName)
        CustomerBusinessHours = member[3].tag
        resident_head.append(CustomerBusinessHours)
        csvwriter.writerow(resident_head)
        count = count + 1

    CustomerCode = member.find('CustomerCode').text
    resident.append(CustomerCode)
    CustomerName = member.find('CustomerName').text
    resident.append(CustomerName)
    CustomerBusinessHours = member[3][1].text
    address_list.append(CustomerBusinessHours)
    CustomerBusinessHoursTimeZoneOffset = member[3][2].text
    address_list.append(CustomerBusinessHoursTimeZoneOffset)
    csvwriter.writerow(resident)
Resident_data.close()

I get the below error:

CustomerBusinessHours = member[3][1].text
IndexError: child index out of range

Expected output:

CustomerCode,CustomerName,CustomerBusinessHoursTimeZoneOffset
ABC,ABC Co,1.000000

Upvotes: 0

Views: 326

Answers (1)

balderman
balderman

Reputation: 23815

The code below is able to collect the data you are looking for.

import xml.etree.ElementTree as ET

xml = '''<?xml version="1.0" encoding="utf-8" standalone="yes" ?>
<Document>
  <Customer>
    <CustomerCode>ABC</CustomerCode>
    <CustomerName>ABC Co</CustomerName>
    <CustomerBusinessHours>
        <CustomerBusinessHoursTimeZoneOffset>1.000000</CustomerBusinessHoursTimeZoneOffset>
    </CustomerBusinessHours>
  </Customer>
</Document>'''

tree = ET.fromstring(xml)
for customer in tree.findall('Customer'):
    print(customer.find('CustomerCode').text)
    print(customer.find('CustomerName').text)
    print(customer.find('CustomerBusinessHours').find('CustomerBusinessHoursTimeZoneOffset').text)

Output

ABC
ABC Co
1.000000

Upvotes: 1

Related Questions