Editing Items in an XML File in Python

Question

I'm trying to take data from a .csv file and create individual .xml files for each row. I've read the .csv into Pandas already. Where I'm struggling is trying to figure out how to make edits in .xml files.

I'm using this previous answer as a guide to try to learn this:

Link

Applying the author's solution to my data would look something like this:

data = """
    VOC2007
    abc.jpg
    

"""

Then I do this:

tree = et.fromstring(data)

Where I'm stuck is the next part. The author edits their file with this line of code:

for data in tree.findall("data"):
    name = data.attrib["name"]
    value = data.find("value")
    value.text = "[%s] %s" % (name, value.text)

I try to apply it to my own like this:

for data in tree.findall("data"):  
    filename = data.find("filename")
    filename.text = "001.jpg"

But this doesn't seem to change anything when I print it out.

print(et.tostring(tree))

What am I doing wrong or what steps do I need to take to edit the name of the image from 'abc.jpg' to '001.jpg'?

Also trying to figure out how to change the values for the four items xmin, ymin, xmax, and ymax.

Laurent LAPORTE · Accepted Answer

I make the assumption you read your CSV file and extract a collection of dictionary-like records, for instance:

record = {
    'folder': "VOC2007",
    'filename': "abc.jpg",
    'name': "blah",
    'pose': "unknown",
    'truncated': "0",
    'difficult': "0",
    'xmin': "0",
    'ymin': "0",
    'xmax': "0",
    'ymax': "0",
}

A simple thing you can do is to use a string template to generate your XML content (since it is very simple):

import textwrap

template = textwrap.dedent("""\

    {folder}
    {filename}
    
""")

To generate your XML content you can do:

from xml.sax.saxutils import escape

escaped = {k: escape(v) for k, v in record.items()}
data = template.format(**escaped)

The function xml.sax.saxutils.escape is used to convert “<“, “>” and “&” into XML entities.

The result is:


    VOC2007
    abc.jpg

Editing Items in an XML File in Python

Answers (2)

Related Questions