Collapse a data frame to a unique row

Question

What I'm trying to do is collapse my data frame such that each unique date has the corresponding variables associated with it. Here is a sample of my data as it is right now (fyi: the full data set I'm using is around 300 obs. with different dates):

date <- c("10/30/17", "10/30/17", "10/30/17", "10/30/17")
eventcode <- c("14", "14", "14", "14")
eoi145 <- c(1, 0, 0, 0)
eoi140 <- c(0, 1, 0, 0)
eoi141 <- c(0, 0, 0, 1)
eoi143 <- c(0, 0, 1, 0)
df <- data.frame(date, eventcode, eoi145, eoi140, eoi141, eoi143)
View(df)

I want to get into this format:

date <- c("10/30/17")
eventcode <- c("14")
eoi145 <- c(1)
eoi140 <- c(1)
eoi141 <- c(1)
eoi143 <- c(1)
df <- data.frame(date, eventcode, eoi145, eoi140, eoi141, eoi143)

I've tried using cast, melt, and reshape. Can anyone give me a hint as to any packages or techniques to get this accomplished.

Thanks!

Tony Breyal · Accepted Answer

One approach from the dplyr package:

library(dplyr)
reduced_df <- df %>%
  group_by(date, eventcode) %>%
  summarise_all(funs(as.integer(sum(.)))) %>%
  ungroup()

With output:

# A tibble: 1 x 6
#  date     eventcode eoi145 eoi140 eoi141 eoi143
#                  
#  10/30/17 14             1      1      1      1

Collapse a data frame to a unique row

Answers (2)

Related Questions