Fix a CSV File's Format

Question

So I'm opening a csv file to parse, but certain lines in the csv are formatted incorrectly. The csv format is typically the following throughout:

'ipAddress','associatedTix''
'
'ipAddress','associatedTix''
'
'ipAddress','associatedTix''
'
'ipAddress','associatedTix''
'
'ipAddress','associatedTix''
'

but at certain points in the csv (since there are more than one associateTix with an ipaddress) when there are more than 1 associatedTix, it gets formatted like the following:

'ipAddress','associatedTix''
'
'ipAddress','associatedTix''
'
'ipAddress','associatedTix''
'
'associatedTix','associatedTix''
'
'associatedTix''
'
'ipAddress','associatedTix''
'
'ipAddress','associatedTix''
'

So what I was going to do to get the csv in the proper format was:

for line in inputCsvFile:
    chunks = line.split(",")
        if associatedTix in chunks[0]:
            #go through the following line's after that line until you find an ip address
            #go one line above the line with the ip address
            #push that column to the above row, and repeat until you get to the original line3 row with the ip address

The 3 commented lines are the one I'm having trouble coming up with syntax for, so any help determining that syntax would be greatly appreciated. Also, confirmation that my logic will get the csv into the correct format would be appreciated as well.

Ignacio Vazquez-Abrams · Accepted Answer

csv handles fields with newlines properly as long as they are quoted:

$ cat t.csv
136.107.169.150,
165.246.197.229,"ESCCB ID#: 90Z-009204,
ESCCB ID#: 90Z-003262,
ESCCB ID#: 90Z-003011                   ESCCB ID#: 90Z-001047"
155.89.77.11,
91.195.188.160,
154.176.191.130,

...

>>> with open('t.csv') as fp:
...   read = csv.reader(fp)
...   for line in read:
...     print line
... 
['136.107.169.150', '']
['165.246.197.229', 'ESCCB ID#: 90Z-009204,
ESCCB ID#: 90Z-003262,
ESCCB ID#: 90Z-003011                   ESCCB ID#: 90Z-001047']
['155.89.77.11', '']
['91.195.188.160', '']
['154.176.191.130', '']

So the problem you think you have, you actually don't. All you need to do is post-process the second field and then write it back out.

Fix a CSV File's Format

Answers (2)

Related Questions

Fix a CSV File&#39;s Format

Answers (2)

Related Questions

Fix a CSV File's Format