Are outliers used to calculate quantiles in box plots in ggplot2?

Question

As stated in the title. I've browsed though a couple of articles and they're really quite vague on this subject. Are all values used when creating the quantiles in a box plot (Q1, Q2, Q3), or only the ones in the "data range" (that is to say, the ones within 1,5 times the inter-quartile range from Q1 or Q3)

I'm creating my boxplots using the ggplot2 package. I write:

fulldata %>%
  filter(status=="påbörjat studier") %>%
  ggplot(aes(x=fct_reorder(urvalsgrupp, PERC_CREDIT, .fun = median), y=PERC_CREDIT)) +
  geom_boxplot() +
  coord_flip()

And I get:

Now as you can see there are two outliers in the HP group. Were these outliers used when calculating the quantiles, or should the box/quantiles (if these values were taken into account) be placed further to the left?

Are outliers used to calculate quantiles in box plots in ggplot2?

Answers (1)

Related Questions