Group by and add new column with min value between dates - pandas

Question

I have this Pandas dataframe:

I want a new DF to group them by ['ticked_id','time_a'] and add a new column with the min difference in time (hh), SQL code that works:

SELECT ticket_id, DATEDIFF('hh', time_a, MIN(time_b)) each_diff from ...

I've tried to group them but it results on an object that I can't see

Daniel Wyatt · Accepted Answer

To group the data and get a column with the minimum date of the time_b column you can do:

df_grouped = df.groupby(['ticket_id', 'time_a'])['time_b'].min().reset_index()

I don't know the datatypes of your time_a and time_b columns but if they are timestamps you can then do the following to get the difference in hours:

df_grouped['each_diff'] = (df_grouped['time_b'] - df_grouped['time_a').astype('timedelta64[h]')

Group by and add new column with min value between dates - pandas

Answers (2)

Related Questions