Making a pivot table across multiple columns of non-numeric data

Question

The following code generates a dummy dataframe:

import pandas as pd

df = pd.DataFrame(
    {
        'user_id': [1,2,3,1,1],
        'account_type': ['google','facebook','apple','facebook','google'],
        'activated': ['y','pending','n','y','y']
    
    }
)

df.head()

What I need is to create a pivot table for unique_values in the account_type column, which aggregates by the user_id column.

Essentially, for each user_id, I want to see how many of each types of accounts have a value other than n under the activated column.

The resulting dataframe should be:

I'm beat by this so far because I the pivot_table function seems to only be able to work with numeric values.

Anurag Dabas · Accepted Answer

Try Via groupby(), where() and transform() method:

df['count']=(df.groupby(['user_id','account_type'])['activated']
           .transform(lambda x:x.where(df['activated'].ne('n')).count()))

Finally use pivot_table() and rename_axis() method:

result=df.pivot_table(index='user_id',columns='account_type',values='count',fill_value=0).rename_axis(columns=None)

Output of result:

          apple     facebook    google
user_id             
1           0       1           2
2           0       1           0
3           0       0           0

Making a pivot table across multiple columns of non-numeric data

Answers (2)

Related Questions