How can I use a two sample t-test when there are two groups in R?

Question

I have a data frame with the categories fruits, ripeness, and mean. How can I create a for loop that runs a ttest to determine the mean difference for the ripeness for EACH fruit? In other words, for apples, the ttest would produce a result of the mean difference between ripe and unripe apples. An example of this would look like the following table.

Joshua Mire · Accepted Answer

Something like this could work for returning p-values of the t-test comparing "Ripeness" as you loop through the unique "Fruits" that appear in your data.

## create a vector of the unique fruit in the data; vector of fruit to be tested
fruit<-unique(data$Fruits)
## iterate through your list of unique fruit, testing as you go
for(i in 1:length(fruit)){
  ## subset your data to include only the current fruit to be tested
  df<-filter(data, Fruits==fruit[i])
  ## let the user know which fruit is being tested
  message(fruit[i])
  ## create a vector of the unique ripeness states of the current fruit to be tested
  ripe<-unique(df$Ripeness)
  ## make sure two means exist; ensure there are both ripe and non-ripe values
  if(length(ripe) < 2){
    ## if only one ripeness, let user know and skip to next unique fruit
    message("only one ripeness")
    next
  }
  ## try testing the fruit and return p-value if success
  tryCatch(
    {
      message(t.test(Mean ~ Ripeness, data = df)$p.value)
    },
    ## if error in t-testing return message that there are "not enough observations"
    error=function(cond) {
      message("not enough observations")
    }
  )    
}

I hope this helps!

How can I use a two sample t-test when there are two groups in R?

Answers (2)

Related Questions