Cumsum column while skipping rows or setting fixed values on a condition based on the result of the actual cumsum

Question

I'm trying to find a vectorized solution in pandas that is quite common in spreadsheets which is to cumsum while skipping or setting fixed values on a condition based on the result of the actual cumsum. I have the following:

What I need is to add a second column with the cumsum of 'A' and if one of these sums gives a positive value replace it with 0 and continue the cumsum using that 0. At the same time if the cumsum gives a negative value that is lower than the lowest value in column A recorded after a 0 in column B I will need to replace it with that lowest value in column A. I know this is quite a problem but is there a vectorized solution for this? Maybe using an auxiliary column. The result should look like this:

    A   B
1   0   0
2  -1  -1  # -1+0 = -1
3   2   0  # -1 + 2 = 1 but  1>0 so this is 0
4   3   0  # same as previous row
5  -2  -2  # -2+0 = -2
6  -3  -3  # -2-3 = -5 but the lowest value in column A since last 0 is -3 so this is replaced by -3
7   1  -2  #  1-3 = -2
8  -1  -3  # -1-2 = -3
9   1  -2  # -3 + 1 = -2
10 -2  -3  # -2-2 = -4 but the lowest value in column A since last 0 is -3 so this is replaced by -3 
11  1  -2  # -3 +1 = -2
12  2   0  # -2+2 = 0
13 -1  -1  # 0-1 = -1
14 -2  -2  # -1-2 = -3 but the lowest value in column A since last cap is -2 so this is -2 instead of -3

For the moment I made this but does not work 100% and again is not really efficient:

df['B'] = 0
df['B'][0] = 0
for x in range(len(df)-1):
    A = df['A'][x + 1]
    B = df['B'][x] + A
    if B >= 0:
        df['B'][x+1] = 0
    elif B < 0 and A < 0 and B < A:
        df['B'][x+1] = A
    else:
        df['B'][x + 1] = B

Cumsum column while skipping rows or setting fixed values on a condition based on the result of the actual cumsum

Answers (1)

Related Questions