Recode and codense variables

Question

I'm working on the output off an online questionnaire and have some trouble handling the data. This is the setups: 200 images have been rated on two 9-point-scales, totaling in 400 combinations. Unfortunately, the data hasn't been in encoded in 400 variables with values ranging from 1 to 9, but for each scale-image combination, 9 binary variables have been encoded, looking like this for two image-scale combinations:

Part. V1 V2 V3 V4 V5 V6 V7 V8 V9 V10 V11 V12 V13 V14 V15 V16 V17 V18
1     0  0  1  0  0  0  0  0  0  0   1   0   0   0   0   0   0   0
2     0  0  0  0  0  0  1  0  0
3                                0   0   1   0   0   0   0   0   0

As you can see, there are also some N/A values in the data set. That's because of all 400 combinations, each participant only rated a randomised 50. Given the 400 combinations, we have a total of 3600 variables in the data set. I would now like to condense and recode those values in a sense, that R counts the vars in intervals of 9, then recodes the binary 1 for a value of 1 to 9, depending on its position on the scale, and then condenses everything into 400 combination variables. In the end, it should look something like this:

Part. C1 C2
1     3  2 
2     7 
3        3

I've looked into the reshape package, but couldn't exactly figure out the way to do this.

Any suggestions?

zx8754 · Accepted Answer

Using apply family functions:

#dummy data
df <- read.table(text = "
Part.,V1,V2,V3,V4,V5,V6,V7,V8,V9,V10,V11,V12,V13,V14,V15,V16,V17,V18
1,0,0,1,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0
2,0,0,0,0,0,0,1,0,0,,,,,,,,,
3,,,,,,,,,,0,0,1,0,0,0,0,0,0
", header = TRUE, sep = ",")

# result

# cbind - column bind, put columns side by side
cbind(
  # First column is the "Part." column
  df[, "Part.", drop = FALSE],
  # other columns are coming from below code
  # sapply returns matrix, converting it to data.frame so we can use cbind.
      as.data.frame(
        # get data column index 9 columns each, first 2 to 9, then 10 to 18, etc.
        sapply(seq(2, ncol(df), 9), function(i)
          # for each 9 columns check at which position it is equal to 1,
          # using which() function
          apply(df[, i:(i + 8)], 1, function(j) which(j == 1)))
      )
)

#output
#   Part. V1 V2
# 1     1  3  2
# 2     2  7   
# 3     3     3

Recode and codense variables

Answers (2)

Related Questions