How to fill the null values with the average of all the preceeding values before null and first succeeding value after null in python?

Question

I have a dataframe with 5000 records. I want the null values to be filled with:

Average(All the Preceding values before null, First succeeding value after null)

data:

Date          gcs     Comp     Clay       WTS
2020-01-01    1550     41      9.41      22.6
2020-01-02    1540     48      9.50      25.8
2020-01-03    NAN      NAN     NAN        NAN
2020-01-04    1542     42      9.30      23.7
2020-01-05    1580     48      9.10      21.2
2020-01-06    NAN     NAN      NAN       NAN
2020-01-07    1520     40      10        20.2
2020-01-08    1523     30      25         19

Example: For the date 2020-01-03, i want the null value in the gcs column to be filled with the Average(1550,1540,1542) which gives 1544.

1550 and 1540 are the preceding values before null and 1542 is my first succeeding value after null.

Similarly,

For the date 2020-01-06 i want the null values for gcs column to be filled with Average(1550,1540,1544,1542,1580,1520) which gives 1546.

1550 till 1580 are the preceding values before null and 1520 is the first succeeding value after null.

Desired Output:

Date          gcs     Comp     Clay       WTS
2020-01-01    1550     41      9.41      22.6
2020-01-02    1540     48      9.50      25.8
2020-01-03    1544     43.66   9.403     24.03
2020-01-04    1542     42      9.30      23.7
2020-01-05    1580     48      9.10      21.2
2020-01-06    1546     43.77   9.45      22.92
2020-01-07    1520     40      10        20.2
2020-01-08    1523     30      25         19

**Edit:

Thanks for the response Tom. i kept my date column as index and tried the below code:

def foo(row):
    if any(row.isna()):
        df.loc[row.name,row.isna()] = df.expanding().mean().shift(-1).loc[row.name,:]
df.apply(foo, axis=1)

The output that i got is :

Date
2020-01-01    None
2020-01-02    None
2020-01-03    None
2020-01-04    None
2020-01-05    None
2020-01-06    None
2020-01-07    None
2020-01-08    None
dtype: object

Can you please help me figure out what is wrong.

How to fill the null values with the average of all the preceeding values before null and first succeeding value after null in python?

Answers (1)

Related Questions