Finding the maximum entry based on another column in a data frame

Question

Suppose I have a data frame with 3 columns: A, B, C. I want to group by column A, and find the row (for each unique A) with the maximum entry in C, so that I can store that row.A, row.B, row.C into a dictionary elsewhere.

What's the best way to do this without using iterrows?

John Zwinck · Accepted Answer

# generate sample data
import pandas as pd
df = pd.DataFrame(np.random.randint(0,10,(10,3)))
df.columns = ['A','B','C']

# sort by C, group by A, take last row of each group
df.sort('C').groupby('A').nth(-1)

Finding the maximum entry based on another column in a data frame

Answers (2)

Related Questions