Getting listed column names of all not nan rows

Question

I've pandas dataframe based on pivot table with index and columns. Index are presented with values that are not nan at least in one column, while others are nans.

          col_1  col_2  col_3  col_4 ...  col_100
index_1     1      2      nan   nan  ...     5 
index_2    nan    nan      1     1   ...     10
...        ...    ...     ...   ...  ...     ...
index_100  nan     9       4    ...  ...     nan

How can I get column names of all the not nan values in a row and put them into automatically suffixed list names by each index? Need to get this:

list_1=[col_1, col_2, col_100]
list_2=[col_3, col_4, col_100]
list_100=[col_2, col_3]

Quang Hoang · Accepted Answer

You can use stack to remove nan and groupby to gather all column names:

(df.stack()
   .reset_index(level=1)
   .groupby(level=0, sort=False)
   ['level_1'].apply(list)
)

Output:

index_1      [col_1, col_2, col_100]
index_2      [col_3, col_4, col_100]
index_100             [col_2, col_3]
Name: level_1, dtype: object

Getting listed column names of all not nan rows

Answers (2)

Related Questions