Pandas average of last x rows

Question

I'd like to say, if index is greater than 10, then set all values to an average of the last 3 rows before the index of 10 (9,8,7)

So far I have this;

df.loc[df.index>10,columns_list]=df.loc[df.index<10 & df.index>=7,columns_list].values.mean

jezrael · Accepted Answer

You are close, need parentheses arounf conditions and axis=0 to numpy mean:

np.random.seed(123)
df = pd.DataFrame(np.random.randint(10, size=(15, 3)), columns=list('abc'))

cols = ['a','b']
df.loc[df.index>10, cols] = df.loc[(df.index<10) & (df.index>=7), cols].values.mean(axis=0)

print (df)
      a         b  c
0   2.0  2.000000  6
1   1.0  3.000000  9
2   6.0  1.000000  0
3   1.0  9.000000  0
4   0.0  9.000000  3
5   4.0  0.000000  0
6   4.0  1.000000  7
7   3.0  2.000000  4
8   7.0  2.000000  4
9   8.0  0.000000  7
10  9.0  3.000000  4
11  6.0  1.333333  5
12  6.0  1.333333  1
13  6.0  1.333333  5
14  6.0  1.333333  6

Pandas average of last x rows

Answers (2)

Related Questions