Python: Cumulative Sum with changing key

Question

I have a table of data such as:

F(1)  F(2)  F(3)  Amount
A     B     C     100
A     B     C     100
A     B     C     100
D     E     F     300
D     E     F     150
G     H     I     100
G     H     I     200

I would like to produce a new column, showing the cumulative sum of field 'Amount', but one that resets to 0 whenever the key of columns F(1), F(2) and F(3) change.

i.e. I would like to create the following output (sans dotted lines!)

F(1)  F(2)  F(3)  Amount  CumSum
A     B     C     100     100
A     B     C     100     200
A     B     C     100     300
------------------------------ resets to zero as key changes
D     E     F     300     300
D     E     F     150     450
------------------------------ resets to zero as key changes
G     H     I     100     100
G     H     I     200     300

I have potentially up to a million rows in this table so I am looking for a robust implementation. Is pandas the way forward here? I have not used pandas before but am happy to explore.

Python: Cumulative Sum with changing key

Answers (1)

Related Questions