How to subtract the minimum from a column in a pandas DataFrame

Question

I'd like to find the elapsed time since the first time an event was observed. For this I saved each observation in a CSV file. Each event is identified by a unique hash.

Right now I'm doing the following:

from pandas import *
from bz2 import BZ2File
events = DataFrame.from_csv(BZ2File('events.csv.bz2', 'r'), sep='	', header=0, index_col=None)
m = events.groupby('hash')['timestamp'].min()

at this point I have a Series indexed by the hash and the timestamp of the first observation. How would I use this to get the time offset for each row in the events DataFrame (simply timestamp - min(timestamp))?

How to subtract the minimum from a column in a pandas DataFrame

Answers (1)

Related Questions