Python 3.6: Creating a pivot table summarizing counts of values for multiple columns in dataframe

Question

I have the following data frame:

df = pd.DataFrame({'X': ['Agree', 'Disagree', 'Agree', 'Neutral', 'Agree','Neutral'],
               'Y': ['Disagree', 'Neutral', 'Agree', 'Disagree', 'Agree', 'Neutral'], 
               'Z': ['Agree', 'Neutral', 'Neutral', 'Disagree', 'Neutral','Neutral']})

I want to create a table summarizing a count of how many 'Agree', 'Neutral' and 'Disagree' there are for each category (column) X, Y and Z.

The output should look like this:

df_answer = pd.DataFrame({'Response': ['Agree', 'Neutral', 'Disagree'],
               'X': [3,2,1],
               'Y': [2,2,2], 
               'Z': [1,4,1]})

I tried to find an answer to this but cant seem to find one that addresses this in particular.

I would prefer for there to be a separate index but it's also okay if the 'Response' is the index if it makes it easier.

ansev · Accepted Answer

We can use DataFrame.apply + pd.value_counts:

new_df=df.apply(pd.value_counts)
print(new_df)

          X  Y  Z
Agree     3  2  1
Disagree  1  2  1
Neutral   2  2  4

We can also do:

df2=df.melt()
new_df=pd.crosstab(df2['value'],df2['variable'])
print(new_df)
variable  X  Y  Z
value            
Agree     3  2  1
Disagree  1  2  1
Neutral   2  2  4

Python 3.6: Creating a pivot table summarizing counts of values for multiple columns in dataframe

Answers (2)

Related Questions