Pandas sequentially apply function using output of previous value

Question

I want to compute the "carryover" of a series. This computes a value for each row and then adds it to the previously computed value (for the previous row).

How do I do this in pandas?

decay = 0.5
test = pd.DataFrame(np.random.randint(1,10,12),columns = ['val'])
test
    val
0   4
1   5
2   7
3   9
4   1
5   1
6   8
7   7
8   3
9   9
10  7
11  2

decayed = []
for i, v in test.iterrows():
    if i ==0:
        decayed.append(v.val)
        continue
    d = decayed[i-1] + v.val*decay
    decayed.append(d)

test['loop_decay'] = decayed
test.head()

    val loop_decay
0   4   4.0
1   5   6.5
2   7   10.0
3   9   14.5
4   1   15.0

Parfait · Accepted Answer

Consider a vectorized version with cumsum() where you cumulatively sum (val * decay) with the very first val.

However, you then need to subtract the very first (val * decay) since cumsum() includes it:

test['loop_decay'] = (test.ix[0,'val']) + (test['val']*decay).cumsum() - (test.ix[0,'val']*decay)

Pandas sequentially apply function using output of previous value

Answers (2)

Related Questions