Calculate Mean in R on multiple columns for values greater X

Question

I have the following data and would like to calculate the mean values for t1-t5, grouped by iid, for every cid.

I only want to calculate the mean for values > 0
Ideally I don't want name all the fields when doing mean e.g. mean(t1), mean(t2). This is because in my real case I have 200+ fields.

Sample data:

cid   iid   t1  t2  t3  t4  t5
4503    2   4   5   5   6   7
4503    2   7   5   3   2   8
841     2   9   2   1   5   7
2134    1   6   9   8   2   1

Here is my code so far. Can somebody please help me to finish it off. Thanks in advance.

library(dplyr)
test <- read.csv("~/Documents/R-SCRIPTS/DATA/test.csv", sep=";")

t <- test %>% 
  group_by(cid, iid) %>%
  select(t1:t5) %>%
  summarise(t1 = mean(t1, na.rm = TRUE), 
            t2 = mean(t2,na.rm = TRUE), 
            t3 = mean(t3,na.rm = TRUE), 
            t4 = mean(t4,na.rm = TRUE), 
            t5 = mean(t5,na.rm = TRUE) 
            )

talat · Accepted Answer

If I understand correctly, you could simply use:

test %>% 
  group_by(cid, iid) %>% 
  summarise_each(funs(mean(.[.>0], na.rm = TRUE)), t1:t5)
#Source: local data frame [3 x 7]
#Groups: cid [?]
#
#    cid   iid    t1    t2    t3    t4    t5
#  (int) (int) (dbl) (dbl) (dbl) (dbl) (dbl)
#1   841     2   9.0     2     1     5   7.0
#2  2134     1   6.0     9     8     2   1.0
#3  4503     2   5.5     5     4     4   7.5

Calculate Mean in R on multiple columns for values greater X

Answers (2)

Related Questions