Sum a group of columns by row count

Question

I'm trying to create a new dataset from an existing one. The new dataset is supposed to combine 60 rows from the original dataset in order to convert a sum of events occurring each second to the total by minute. The number of columns will generally not be known in advance.

For example, with this dataset, if we split it into groups of 3 rows:

We'll get this data.frame. Row 1 contains the column sums for rows 1-3 of d1 and Row 2 contains the column sums for rows 4-6 of d1:

I've tried d2<-colSums(d1[seq(1,NROW(d1),3),]) which is about as close as I've been able to get.

I've also considered recommendations from How to sum rows based on multiple conditions - R?,How to select every xth row from table,Remove last N rows in data frame with the arbitrary number of rows,sum two columns in R, and Merging multiple rows into single row. I'm all out of ideas. Any help would be greatly appreciated.

Rich Pauloo · Accepted Answer

Create a grouping variable, `group_by` that variable, then `summarise_all`.

# your data
d <- data.frame(a = c(1,0,0,0,0,1),
                b = c(1,1,1,0,0,0),
                c = c(0,0,0,1,1,1),
                d = c(1,1,0,0,0,0))

# create the grouping variable 
d$group <- rep(c("A","B"), each = 3)

# apply the mean to all columns
library(dplyr)
d %>% 
  group_by(group) %>% 
  summarise_all(funs(sum))

Returns:

# A tibble: 2 x 5
  group     a     b     c     d
      
1 A         1     3     0     2
2 B         1     0     3     0

Sum a group of columns by row count

Answers (2)

Overview

Create a grouping variable, `group_by` that variable, then `summarise_all`.

Related Questions

Sum a group of columns by row count

Answers (2)

Overview

Create a grouping variable, group_by that variable, then summarise_all.

Related Questions

Create a grouping variable, `group_by` that variable, then `summarise_all`.