Group by column value and set it as index in Pandas

Question

I have a dataframe df1 that looks like this:

df1 = pd.DataFrame({'A':[0,5,4,8,9,0,7,6],
                   'B':['a','s','d','f','g','h','j','k'],
                   'C':['XX','XX','XX','YY','YY','WW','ZZ','ZZ']})

My goal is to group the elements according to the values contained in column Cso that rows having the same value, have the same index (which must contain the value stored in C). Therefore the output should be like this:

I tried to use the command df.groupby('C') but it returns the following object:

Can you suggest me an elegant and smart way to achieve my goal?

Note: I think my question is somehow related to multi-indexing

jezrael · Accepted Answer

It seems you need DataFrame.set_index

df2 = df1.set_index('C')
print (df2)
    A  B
C       
XX  0  a
XX  5  s
XX  4  d
YY  8  f
YY  9  g
WW  0  h
ZZ  7  j
ZZ  6  k

print (df2.loc['XX'])
    A  B
C       
XX  0  a
XX  5  s
XX  4  d

If need MultiIndex from columns C and A:

df3 = df1.set_index(['C', 'A'])
print (df3)
      B
C  A   
XX 0  a
   5  s
   4  d
YY 8  f
   9  g
WW 0  h
ZZ 7  j
   6  k

print (df3.loc['XX'])
   B
A   
0  a
5  s
4  d

Group by column value and set it as index in Pandas

Answers (2)

Related Questions