Modify some rows in a csv file using python

Question

I have a csv file as follows:

name,row,column,length_of_field
AB000M,8,12,1
AB000M,9,12,1
AB000M,10,0,80
AB000M,10,12,1
AB000M,11,1,1
AB000M,21,0,80
AB000M,22,0,80

What I am trying to do is whenever the column field is 0, add the field length to column and row should be decremented by 1. Also if there is a subsequent line where column is again 0, then add the first field length to the column, decrease row by 1 for the first occurrence and add the field lengths together. And then delete the older lines.

So in the above csv - "AB000M,10,0,80" would become "AB000M,9,80,80" and "AB000M,21,0,80
AB000M,22,0,80" --- these two lines should be replaced by "AB000M,20,80,160".

I am trying to achieve this with this snippet but its not working:

df = pd.read_csv("file.csv")
for ind in df.index:
    if ind >= len(df)-1:
        break
    if df['column'][ind] == 0 and df['column'][ind + 1] != 0:
        df['row'][ind] -=  1
        df['column'][ind] = 80
    elif df['column'][ind] == 0 and df['column'][ind + 1] == 0:
        df['row'][ind] -=  1
        df['column'][ind] = 80
        df['length_of_field'][ind] += df['length_of_field'][ind + 1]
        df.drop([df.index[ind + 1]], axis=0)

BeanBagTheCat · Accepted Answer

This is an example of something that may work for you.

import pandas as pd

df = pd.read_csv('test.csv')
newRows = []
last_val_is_zero = False
tempRow = None
for row in df.iterrows():
    vals = row[1]
    if vals['column'] == 0:
        if not last_val_is_zero:
            vals['row'] = vals['row'] - 1
            vals['column'] = vals['length_of_field']
            tempRow = vals
            last_val_is_zero = True
        else:
            tempRow['length_of_field'] = tempRow['length_of_field'] + vals['length_of_field']
    else:
        if tempRow is not None:
            newRows.append(tempRow)
        newRows.append(vals)
        tempRow = None
        last_val_is_zero = False

if tempRow is not None:
    newRows.append(tempRow)
    
newData = [[val for val in row] for row in newRows]
newDf = pd.DataFrame(newData, columns=[x for x in newRows[0].keys()])

Modify some rows in a csv file using python

Answers (2)

Related Questions