Is it possible to do a groupby on the result of a groupby?

Question

I don't think I need to share the entire dataframe, but basically, this is the line of code in question (with pandas already imported, of course)

divstack = df[df['Competitor']=='Emma Slabach'].groupby(['Division','Stack'])['Time'].min()

The output is :

>>> divstack
Division  Stack 
6U F      3/3/03     2.66
          3/6/03     4.81
          Cycle     13.89
7-8 F     3/3/03     2.41
          3/6/03     2.68
          Cycle      7.71
9-10 F    3/3/03     2.13
          3/6/03     2.75
          Cycle      6.94
Name: Time, dtype: float64

I already grabbed Emma's fastest time is 2.13, thanks to this line of code:

emma = df[df['Competitor']=='Emma Slabach'].groupby(['Competitor'])['Time'].min()

and the output is:

>>> emma
Competitor
Emma Slabach    2.13 
Name: Time, dtype: float64

But how would I go about modifying the first line of code from earlier to specifically obtain the Division and Stack (along with Time) of when her fastest time occurred? (Division 9-10F and Stack 3/3/03).

I don't think a function is necessary, but is there a way I can perform another groupby on top of that first groupby output (divstack) I got, to further "minimize" and get her fastest time? Or could I input emma somewhere in divstack to obtain which division/stack that time occurs?

I need to store the division, stack, and time into divstack

andrew_reece · Accepted Answer

Given divstack, you can retrieve the full MultiIndex entry with .loc and min():

divstack.loc[divstack.eq(divstack.min())]

Division  Stack 
9-10 F    3/3/03    2.13
Name: Time, dtype: float64

Is it possible to do a groupby on the result of a groupby?

Answers (2)

Related Questions