Pandas dynamic Groupby and Shift

Question

I am attempting to perform a dynamic shift within a groupby object. In this case my grouping is Account and each account will have the column Valuation shifted by minus the number of rows specified in the column Shift. There was a similar question a while ago but that involved a cumsum, where as here I just want the value. See dynamic shift with groupby on dataframe. If possible I'd like to avoid an apply for performance reasons as I have 10s of millions of rows.

import pandas as pd
import numpy as np

    df = pd.DataFrame({
        'Account': [1000001, 1000001, 1000001, 1000001, 1000001, 1000001, 1000001,
                    1000001, 1000001, 1000001, 1000002, 1000002, 1000002, 1000002,
                    1000002, 1000002, 1000002, 1000002, 1000002],
        'Date': ['Jan-18', 'Feb-18', 'Mar-18', 'Apr-18', 'May-18', 'Jun-18',
                 'Jul-18', 'Aug-18', 'Sep-18', 'Oct-18', 'Jan-18', 'Feb-18',
                 'Mar-18', 'Apr-18', 'May-18', 'Jun-18', 'Jul-18', 'Aug-18',
                 'Sep-18'],
        'Valuation':[ 50000,  51000,  52020,  53060,  54122,  55204,  56308,  57434,
                     58583,  59755, 100000, 102000, 104040, 106121, 108243, 110408,
                     112616, 114869, 117166],
        'Shift': [3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 2, 2, 2, 2, 2, 2, 2, 2, 2]       })

The desired dataframe looks like this:

moys · Accepted Answer

check this out.

def sh(x):
    s = df.loc[x.index, 'Shift']
    return (x.shift(-s.iloc[0]))
df['Valuation_shifted']= (df.groupby('Account')['Valuation'].apply(sh))

I know you said you did not want to do apply. But in this case, we are not doing lambda apply. Rather, we are doing a function that finds out the first value of the column 'Shift' in each group & shifts 'Valuation_shifted' by that much.

Pandas dynamic Groupby and Shift

Answers (2)

Related Questions