Python DataFrame Copy

Question

I am trying to create a new dataframe based on some criteria based on an original dataframe.

df = pandas.io.sql.read_sql(sql, conn)

Count_Row = df.shape[0]
for j in range(Count_Row - 1):

    if df.iloc[j, 0] == df.iloc[j + 1, 0]:
        print(df.iloc[j, 2] + df.iloc[j + 1, 2], df.iloc[j, 4], df.iloc[j, 6], df.iloc[j, 3])

However instead of printing I want to add that data to a new dataframe.

How is this possible?

rgalbo · Accepted Answer

Instead of printing out the data you can append it to a new data frame

import pandas as pd

df = pandas.io.sql.read_sql(sql, conn)
Count_Row = df.shape[0]

results = pd.DataFrame() # create data frame to store results

for j in range(Count_Row - 1):
    if df.iloc[j, 0] == df.iloc[j + 1, 0]:
        # create row of values to append
        row = pd.Series([df.iloc[j, 2] + df.iloc[j + 1, 2], 
                        df.iloc[j, 4], 
                        df.iloc[j, 6], 
                        df.iloc[j, 3]])
        results = results.append([row])

results.columns = ['v1', 'v2', 'v3', 'v4'] # the variables

This will give you a data frame with the desired output

Python DataFrame Copy

Answers (2)

Related Questions