How do you group, sort, and limit in Python Pandas? (i.e. Get Top 10)

Question

I have a Pandas dataframe that has columns actor_id and account_id. Actor is a person and account is simply an account. So a person can have more than one account and accounts can have multiple people.

My goal is to group by actor_id and then rank the actor_ids by the number of accounts they have so that I can get a list of the Top 10 actors with the most accounts.

In SQL, it would be something like SELECT actor_id, account_id, COUNT(account_id) GROUP BY actor_id LIMIT 10. But I am Trying to do this in Python.

I referenced this Pandas group and sort by index count but it did not work for me. Below is the code I've tried.

df['count'] = df['actor_id'].map(df['account_id'].value_counts())
df.sort_index('count', ascending=False)

The dataset looks like:

In the picture, replace project_id with account_id.

How do you group, sort, and limit in Python Pandas? (i.e. Get Top 10)

Answers (1)

Related Questions