Pandas arrange rows based on duplicate values of selected column

Question

I have a dataframe that looks like -

ID  NAME    AGE
1   ABC     10
2   XYZ     12
3   PQR     20
4   ABC     25
5   XYZ     30
6   PQR     20
7   KLM     22
8   NOP     16

I would like to re-arrange and keep only the following rows based on the duplicate values on NAME column -

ID  NAME    AGE
1   ABC     10
2   ABC     25
3   XYZ     12
4   XYZ     30
5   PQR     20
6   PQR     20

with following rows removed -

7   KLM     22
8   NOP     16

Thanks!

wwnde · Accepted Answer

Please try

df[df.duplicated(subset=['NAME'], keep=False)].sort_values(by='NAME')

Answers (2)