Collapse Column in R

Question

I have searched as best I could, and part of my problem is that I'm really not sure exactly what to ask. Here's my data, and how I want it to end up:

Now:

john    a Yes
john    b No
john    c No
Rebekah a Yes
Rebekah d No
Chase   c Yes
Chase   d No
Chase   e No
Chase   f No

How I'd like it to be:

john     a,b,c    Yes
Rebekah  a,d      Yes
Chase    c,d,e,f  Yes

Notice that the 3rd column says yes when it is the first row with that particular value in the 1st column. The 3rd row isn't necessary, I was just using it, thinking that I would try to do this all with if and for statements, but I thought that would be so inefficient. Is there any way to make this work efficiently?

Veerendra Gadekar · Accepted Answer

Another option would be (using data mentioned by @bgoldst)

library('dplyr')

out = df %>% 
      group_by(a) %>% 
      summarize(b = paste(unique(c(b)), collapse=","), c = "yes")

#> out
#Source: local data frame [3 x 3]

#        a       b   c
#1   Chase c,d,e,f yes
#2 Rebekah     a,d yes
#3    john   a,b,c yes

using data.table

out = setDT(df)[, .(b = paste(unique(b),  collapse=','), c = "yes"), by = .(a)]

#> out
#         a       b   c
#1:    john   a,b,c yes
#2: Rebekah     a,d yes
#3:   Chase c,d,e,f yes

Collapse Column in R

Answers (2)

Related Questions