Find Duplicate rows from df. Python

Question

df = 

Name    Age City
Jack    34  Sydney
Riti    30  Delhi
Aadi    16  New York
Riti    30  Delhi
Riti    30  Delhi
Riti    30  Mumbai
Aadi    40  London
Sachin  30  Delhi

df[df.duplicated(keep='last')]

The above code gives the list of duplicated. But what I need is if the df contains atleast 1 duplicate, then it should return The df contains duplicate rows.

Sayandip Dutta · Accepted Answer

You can use any:

>>> df
     Name  Age     City
0    Jack   34   Sydney
1    Riti   30    Delhi
2    Aadi   16  NewYork
3    Riti   30    Delhi
4    Riti   30    Delhi
5    Riti   30   Mumbai
6    Aadi   40   London
7  Sachin   30    Delhi
>>> df.duplicated().any()
True
>>> 'The df contains duplicates' if df.duplicated().any() else 'no duplicates' 
'The df contains duplicates'

Find Duplicate rows from df. Python

Answers (2)

Related Questions