Aggregating rows and creating columns as count of values

Question

I have a dataframe that looks something like:

   tt  oo
0  g  gh
1  g  jj
2  g  gh
3  t  gh
4  t  gh

I'd like to end up with a new dataframe that aggregates on 'tt', giving counts of the 'oo' column so that it looks like:

   gh  jj
g  2   1
t  2   0

I tried a pivot table but ended up with an 'Index contains duplicate entries error'. t

ely · Accepted Answer

dfrm1 = pandas.DataFrame({'tt':['g', 'g', 'g', 't', 't'], 
                          'oo':['gh', 'jj', 'gh', 'gh', 'gh']})

dfrm1.groupby('tt')['oo'].value_counts().unstack(level=1).fillna(0.0)

Aggregating rows and creating columns as count of values

Answers (2)

Related Questions