Adding dataframe columns together, separated by columns considering NaNs

Question

How could NaN values be completely ommitted from the new column in order to avoid consecutive commas?

df['newcolumn'] = df.apply(''.join, axis=1)

One approach would probably be using a conditional lambda

df.apply(lambda x: ','.join(x.astype(str)) if(np.isnan(x.astype(str))) else '', axis = 1)

But this returns an error message:

TypeError: ("ufunc 'isnan' not supported for the input types, and the inputs could not be safely coerced to any supported types according to the casting rule ''safe''", 'occurred at index 0')

Edit: Both your answers work. In order to obtain the answer, what critera would I use to determine which one to code? Performance considerations?

BENY · Accepted Answer

You can using stack , since it will remove the NaN by default

df.stack().groupby(level=0).apply(','.join)
Out[552]: 
0    a,t,y
1      a,t
2    a,u,y
3    a,u,n
4      a,u
5    b,t,y
dtype: object

Data input

df
Out[553]: 
  Mary John David
0    a    t     y
1    a    t   NaN
2    a    u     y
3    a    u     n
4    a    u   NaN
5    b    t     y

Adding dataframe columns together, separated by columns considering NaNs

Answers (2)

Related Questions