Replace duplicates in a group with the majority vote in Pandas

Question

I have this Pandas dataframe

image	user	answer
img_01	1	1
img_01	2	0
img_01	2	1
img_01	2	0
img_01	3	1
img_01	4	1
img_02	1	1
img_02	...	...

As you can see, user 2 gave 3 answers in total for img_01, but not always the same. This happens throughout the dataset with different images and users. I know I can acquire the (image/user) combinations of these duplicates with

g = dataset.groupby('image')['user'].value_counts()
g = g[g > 1]

Now I want to replace his 3 answers with the majority vote among his answers or drop him entirely. How can I do that?

Replace duplicates in a group with the majority vote in Pandas

Answers (1)

Related Questions