Converting pandas dataframe to XML

Question

I know this question has been asked before and my last was put on hold, so now I'm specifying it detailed. I have a CSV file of population information, I read it to pandas and now have to transform it to XML, for example like this
Akaa 2014 17052 ......

This is the reading part of my code:
import pandas as pd pop = pd.read_csv(r'''directory\population.csv''', delimiter=";")

Tried doing it like in mentioned before in the link here with function and cycle: How do convert a pandas/dataframe to XML?. Haven't succeeded, any other recommendations maybe?

This is an example of my dataframe:
Alahärmä 2014 0 0.1 0.2 0 Alajärvi 2014 10171 5102 5069 1 Alastaro 2014 0 0 0 2 Alavieska 2014 2687 1400 1287 3 Alavus 2014 12103 6102 6001 4 Anjalankoski 2014 0 0 0

Fairly new to python, so any help is apreciated.

Paula Livingstone · Accepted Answer

The question you have linked to actually has a great answer to your question but I guess you’re having difficulty transposing your data into that solution so Ive done it below for you.

Ok your level of detail is a bit sketchy. If your specific situation differs slightly then you'll need to tweak my answer but heres something that works for me:

First off assuming you have a text file as follows :

0       Alahärmä  2014      0   0.1   0.2
1      Alajärvi  2014  10171  5102  5069
2      Alastaro  2014      0     0     0
3     Alavieska  2014   2687  1400  1287
4        Alavus  2014  12103  6102  6001
5  Anjalankoski  2014      0     0     0

Moving on to creating the python script, we first import that text file using the following line:

pop = pd.read_csv(r'directory\population.csv', delimiter=r"\s+", names=['cityname', 'year', 'total', 'male', 'females'])

This brings in the text file as a dataframe and gives the new dataframe the correct column headers.

Then taking the data from the question you linked to, we add the following to our python script:

def func(row):
    xml = ['']
    for field in row.index:
        xml.append('  {1}'.format(field, row[field]))
    xml.append('')
    return '
'.join(xml)

print('
'.join(pop.apply(func, axis=1)))

Now we put it all together and we get the below:

import pandas as pd
pop = pd.read_csv(r'directory\population.csv', delimiter=r"\s+", names=['cityname', 'year', 'total', 'male', 'females'])

def func(row):
    xml = ['']
    for field in row.index:
        xml.append('  {1}'.format(field, row[field]))
    xml.append('')
    return '
'.join(xml)

print('
'.join(pop.apply(func, axis=1)))

When we run the above file we get the following output:


  Alahärmä
  2014
  0
  0.1
  0.2


  Alajärvi
  2014
  10171
  5102.0
  5069.0


  Alastaro
  2014
  0
  0.0
  0.0


  Alavieska
  2014
  2687
  1400.0
  1287.0


  Alavus
  2014
  12103
  6102.0
  6001.0


  Anjalankoski
  2014
  0
  0.0
  0.0

Converting pandas dataframe to XML

Answers (1)

Related Questions