Python create a data frame by using last n rows

Question

I have a pandas df as follows:

Value1    Value2    Label
15.1      12         0
17        5          1
19        2          1

I am looking to build a new df, such that each row contains thee input of the previous n rows. For example if n=2 my output should be

Value1.1  Value2.1    Value1.2    Value2.2    Value1    Value2   Label
15.1       12          17           5          19        2         1

This is the third row and has a label=1, the Value1 and Value2 of the previous 2 rows are appended to the third row. Any thoughts on how I can achieve this in python? Thanks!

Pierre D · Accepted Answer

Perhaps something like:

n = 2
sel = [k for k in df.columns if k != 'Label']
df2 = df
for k in range(1, n + 1):
    df2 = df2.join(df[sel].shift(k), rsuffix=f'.{k}')

print(df2)
   Value1  Value2  Label  Value1.1  Value2.1  Value1.2  Value2.2
0    15.1      12      0       NaN       NaN       NaN       NaN
1    17.0       5      1      15.1      12.0       NaN       NaN
2    19.0       2      1      17.0       5.0      15.1      12.0

Or, if you prefer the column order you indicated in your example:

df2 = df
for k in range(1, n+1):
    df2 = df[sel].shift(k).join(df2, lsuffix=f'.{k}')

print(df2)
   Value1.2  Value2.2  Value1.1  Value2.1  Value1  Value2  Label
0       NaN       NaN       NaN       NaN    15.1      12      0
1       NaN       NaN      15.1      12.0    17.0       5      1
2      15.1      12.0      17.0       5.0    19.0       2      1

Python create a data frame by using last n rows

Answers (2)

Related Questions