Groupby a column value for 2 hour

Question

I have a data frame with columns Date, Time, Distance. I want to groupby distance column for each day with 2 hours interval. I have checked the question asked in stack overflow, but it is not giving me the required output.

The sample code I have pasted is from stack overflow, but the output is not as desired.

uniquedate = np.unique(df2["Date"])
for i in uniquedate:
    data_new1 = df2[df2['Date']==i]
    data_new2 = data_new1[['Real_time', 'Distance_km']].copy()
    data_new2.index = pd.to_timedelta(data_new2.index.astype(str))
    data_new2 = data_new2.resample('2H').sum()
    print(data_new2)

                 Distance_km
00:00:00.000000   276.850844
                 Distance_km
00:00:00.000000   193.115217

I got this output but this is not as required

user12029183 · Accepted Answer

Try this:

data_new2.groupby([data_new2.index,pd.TimeGrouper(freq='2H')]).sum()

Please avoid copy pasting code from stack overflow. More often than not the solution will not suffice for your use case. Plus it's lazy.

EDIT Make your index as datetime or timestamp aand then doing a simple resample should work for you.

import pandas as pd
idx = pd.date_range('2018-01-01', periods=5, freq='H')
ts = pd.Series(range(len(idx)), index=idx)
ts = ts.resample('2H').sum()

Groupby a column value for 2 hour

Answers (2)

Related Questions