Python filling in missing values based on existing data

Question

I have a dataframe containing a one missing value.

   exam_id   exam  
0        1   french   
1        2   italian 
2        3   chinese  
3        4   english  
4        3   chinese  
5        5   russian  
6        1   french       
7      NaN   russian   
8        1   french   
9        2   italian

I want to fill in the missing exam_id for russian exam based on existing information. Since exam_id for russian is 5 I would like to have the same value assigned to the missing one.

akuiper · Accepted Answer

You can group your data frame by exam, then do a ffill + bfill in case there are missing values before and after the existing value:

df.groupby("exam").ffill().bfill()

Python filling in missing values based on existing data

Answers (2)

Related Questions