How to keep ONLY duplicated values in Pandas Dataframe?

Question

I need to drop the feature 'county' that is unique to every row and therefore have no value in my machine learning process.

However, the below code is not removing the unique values, for just county, as they are still in my dataset? HELP.

# counting unique values
n = len(pd.unique(data['county']))
  
print("No.of.unique values :", 
      n)


data[data.groupby('county')['county'].transform('size') > 1]
data

state	county
AL	Barbour County
AL	Barbour County
WY	Sweetwater County

I also tried

data = data[data.duplicated(subset=['county'], keep=False)]

no luck.

How to keep ONLY duplicated values in Pandas Dataframe?

Answers (1)

Related Questions