Formatting a data.frame with binary values

Question

I have a dataframe with 4 columns and 4 rows. For simplicity, I changed it to numeric format. The schema is as follows:

df <- structure(list(a = c(1,2,2,0),
                     b = c(2,1,2,2),
                     c = c(2,0,1,0),
                     d = c(0,2,1,1)),row.names=c(NA,-4L) ,class = "data.frame")

I would like to change this data frame and obtain the following:

   1     2
1  a     b/c
2  b     a/c/d
3  c     a
4  c/d   b

Is there a function or package I should look into? I have been doing lots of text processing in R recently. I'd appreciate your assistance!

thelatemail · Accepted Answer

tapply fun with some row and col indexes (stealing df from Ronak's answer):

tapply(
  colnames(df)[col(df)],
  list(row(df), unlist(df)),
  FUN=paste, collapse="/"
)[,-1]

#  1     2      
#1 "a"   "b/c"  
#2 "b"   "a/c/d"
#3 "c"   "a"    
#4 "c/d" "b"

Basically I'm taking one long vector representing each column name in df, and tabulating it by the combination of the row of df, and the original values in df.

Formatting a data.frame with binary values

Answers (2)

Related Questions