using 'groupby.count' with agg

Question

df.head

                Populous        Continents
Australia   2.331602e+07        Australia
Brazil      2.059153e+08        South America
Canada      3.523986e+07        North America
China      1.367645e+09         Asia
France     6.383735e+07         Europe

Above are the first 5 entries of my dataframe. I want to group them by Continents, then I want to perform some statistical analysis. I want to create a new dataframe with the Avg, Sum, STD of each Group's populous as well as the count of countries in each group, as its columns.

new_df =df.groupby('Continents')['Populous'].agg({ 'Avg': np.average, 'Sum':np.sum, 'STD': np.std}), takes care of three columns, but I don't know how to get count in there. I tried including 'Size': count , within the agg method, but it resulted in an error.

Thank you.

pansen · Accepted Answer

You can use 'Size': len or 'Size': 'count' for this to work. However, as @DSM pointed out, len does count missing values whereas 'count' doesn't.

using 'groupby.count' with agg

Answers (2)

Related Questions

using &#39;groupby.count&#39; with agg

Answers (2)

Related Questions

using 'groupby.count' with agg