How to create a new column containing names of columns that are Nan with pandas?

Question

If i've a dataframe like this:

   A     B      C
 Nan   1.0    0.0
 1.0   Nan    1.0
 1.0   0.0    Nan

I want to create a new column in the dataframe that will provide info about which column in each row contains contains nan values.

   A     B      C     Col4

 Nan   1.0    Nan     A,C  
 1.0   Nan    1.0     B
 1.0   Nan    Nan     B,C

Any help?

jezrael · Accepted Answer

Compare by DataFrame.isna and use DataFrame.dot with columns names, last remove last , by Series.str.rstrip:

df['col4'] = df.isna().dot(df.columns + ',').str.rstrip(',')
#if values are strings Nan
#df['col4'] = df.eq('Nan').dot(df.columns + ',').str.rstrip(',')
print (df)
     A    B    C col4
0  NaN  1.0  NaN  A,C
1  1.0  NaN  1.0    B
2  1.0  NaN  NaN  B,C

How to create a new column containing names of columns that are Nan with pandas?

Answers (2)

Related Questions