How to Add Significance bars a Bar Plot that have percentage/logical data

Question

I have a dataframe with a categorical variable and a few logical variables. I’ve created an example using the iris dataset:

# Create a new dataframe showing the percentage each variable that is considered long 
# Numerical variables are first converted to logical based on if they are above or below a certain threshold, then the percentage of each variable that is true is calculated 

long_data <- iris %>%
  mutate(
    Long.Sepal.Length = Sepal.Length > 6,
    Long.Sepal.Width = Sepal.Width > 3,
    Long.Petal.Length = Petal.Length > 2.5,
    Long.Petal.Width = Petal.Width > 0.3
  ) %>%
  group_by(Species) %>%
  summarise(
    Long.Sepal.Length = mean(Long.Sepal.Length) * 100,
    Long.Sepal.Width = mean(Long.Sepal.Width) * 100,
    Long.Petal.Length = mean(Long.Petal.Length) * 100,
    Long.Petal.Width = mean(Long.Petal.Width) * 100
  ) %>%
  pivot_longer(
    cols = starts_with("Long"),
    names_to = "Measurement",
    values_to = "Percentage_True"
  )

> long_data
# A tibble: 12 × 3
   Species    Measurement       Percentage_True
                                
 1 setosa     Long.Sepal.Length               0
 2 setosa     Long.Sepal.Width               84
 3 setosa     Long.Petal.Length               0
 4 setosa     Long.Petal.Width               18
 5 versicolor Long.Sepal.Length              40
 6 versicolor Long.Sepal.Width               16
 7 versicolor Long.Petal.Length             100
 8 versicolor Long.Petal.Width              100
 9 virginica  Long.Sepal.Length              82
10 virginica  Long.Sepal.Width               34
11 virginica  Long.Petal.Length             100
12 virginica  Long.Petal.Width              100

I want to create a bar plot to compare the percentage of TRUE values for each logical variable by species. Here’s how I plan to create the plot:

# Create the bar plot
ggplot(long_data, aes(x = Measurement, y = Percentage_True, fill = Species)) +
  geom_bar(stat = "identity", position = "dodge") +
  labs(
    x = "Measurement",
    y = "% True",
    fill = "Species"
  )

Plot I want to add significance bars to

How can I add significance bars above each group of bars?

I’ve tried using ggpubr functions, but I’m not sure how to make it work. I envison the plot to look something like the final graph on this page:https://rpkgs.datanovia.com/ggpubr/reference/geom_pwc.html But I cannot get it to work with my logical dataset, as the plot is not using the original data but the long format.

example of what I want my significance bars to look like

How to Add Significance bars a Bar Plot that have percentage/logical data

Answers (1)

Related Questions