Fill NaN value after grouping twice

Question

Dummy data :

code = ['a','a','a','a','b','b']
serial  =  ['x','y','x','y','x','y']
result = [123,  np.nan, 453, 675, 786, 332]

  code serial  result
0  a    x      123.0 
1  a    y     NaN    
2  a    x      453.0 
3  a    y      675.0 
4  b    x      786.0 
5  b    y      332.0

I want to fill NaN with 675.0, first group by code then by serial and fill the NaN value

Code:

df['result'] = df['result'].fillna(df.groupby('code')['result'].ffill())

In the code above; I want to integrate .groupby('serial')

perl · Accepted Answer

You can groupby both columns at the same time:

df['result'] = df.groupby(['code', 'serial'])['result'].bfill()
df

Output:

  code serial  result
0    a      x   123.0
1    a      y   675.0
2    a      x   453.0
3    a      y   675.0
4    b      x   786.0
5    b      y   332.0

P.S. You would need to bfill instead of ffill it though, since that NaN comes before the first value in the group

Fill NaN value after grouping twice

Answers (2)

Related Questions