Dataframe use grouped row values as a row name

Question

i just started to work with python (pandas) and now i have my first question. I have a dataframe with the following row names:

ID  A    Class
1  True  [0,5]
2  False [0,5]
3  True  [5,10]
4  False [10,20]
5  True  [0,5]
6  False [10,20]

Now i'm looking for a cool solution, where i can do something like this:

Class  True   False
[0,5]   2      1
[5,10]  1      0
[10,20] 0      2

I want to count how much True and False i have for a Class Is there a fast solution? My Dataframe could have more than 2 million entries.

Fabio Lamanna · Accepted Answer

Let df be your dataframe, I would first use:

g = df.groupby('Class')['A'].value_counts().reset_index()

that returns:

     Class      A  0
0    [0,5]   True  2
1    [0,5]  False  1
2  [10,20]  False  2
3   [5,10]   True  1

then I would pivot the above table to get your desired shape:

a = pd.pivot_table(g, index='Class', columns='A', values=0).fillna(0)

This returns:

A        False  True 
Class                
[0,5]      1.0    2.0
[10,20]    2.0    0.0
[5,10]     0.0    1.0

Answers (2)