How to groupby column and return a dataFrame instead of groupby object

Question

I have a dataFrame that looks as such:

Date        Yearly_cost
2009-01-01  230
2010-03-03  260
2009-01-01  320
2007-03-02  430

The same dataFrame contains multiple duplicate values for Date but different values for Yearly_cost. I want to groupby Date so that I have a consistent time series with all corresponding values for each day. However I want it to return a df rather than a groupby object.

The desired result would look as such:

Date Yearly_cost 2007-03-02 430 2009-01-01 230, 320 2010-03-03 260

Any help would be appreciated

U13-Forward · Accepted Answer

To answer the revised question, use:

df.groupby('Date')['Yearly_cost'].apply(list).reset_index(name='Yearly_cost')

If you want to change e.g. [320] to 320, do:

df.groupby('Date')['Yearly_cost'].apply(list).apply(lambda x: x[0] if len(x) == 1 else x).reset_index(name='Yearly_cost')

How to groupby column and return a dataFrame instead of groupby object

Answers (2)

Related Questions