Maximus
Maximus

Reputation: 89

How to remove outliers from a dataset using bivariate boxplot

I have a data-set (See below) that is made up of multiple variables, two of these are 'manu' and 'popul' and they both contain numeric values.

enter image description here

From this data I plotted a bivariate boxplot using 'manu' and 'popul' so that I could find outliers between these two variables. This is what it looks like:

enter image description here

Then from this plot I can see there are a few outliers, I was able to identify what values are outliers using the code below, I will also show which values are outliers:

enter image description here

enter image description here

What I would like to know is how do I now take the row that contains these values and remove them from the dataset?

Thanks in advance.

Upvotes: 0

Views: 237

Answers (1)

the-mad-statter
the-mad-statter

Reputation: 8961

You can extract the desired subset like so:

enter image description here

Upvotes: 1

Related Questions