python column replacement and fillnna

Question

I have following dataframe:

df:
A   B_x B_y C_x C_y
R1  0   3   6   7
R2  NAN 4   8   9
R3  2   5   NAN 2

I am looking to replace NAN values from columns with _x with corresponding columns of _y.

I cannot use absolute names like B_x and B_y since the column names are dynamically derived from previous code and i have no control over it.

Right now i am using the following:

ens_prefix is a variable which holds values of column along with _x
ens_prefix_1 is a variable which holds values of column along with _y

    df[ens_prefix].fillna(df[ens_prefix_1], inplace=True)   # replace values of NAN from _x column with _y
    df = df.filter(regex=r'.*(?



Expected output:

    A    B    C
0  R1  0.0  6.0
1  R2  4.0  8.0
2  R3  2.0  2.0

jezrael · Accepted Answer

First I prefer use replace in columns names instead strip, because strip should aslo remove all last x, y values not only after _, but also before _.

Solution useDataFrame.fillna with replaced _x to empty strings with selected last _y and removed columns by _y in last step:

df = (df.rename(columns = lambda x: x.replace('_x', ''))
        .fillna(df.filter(regex='_y$')
                  .rename(columns = lambda x: x.replace('_y', '')))
        .filter(regex=r'.*(?

python column replacement and fillnna

Answers (2)

Related Questions