R - Extract highest value in a vector and replace using that value

Question

I have a very large data frame structured like this:

User          A          B          C          D
1             1,0        0,0        0,2        1,1
2             1,1,1      0,0,1      0,0,0      2,2,1
3             1          0          0          1
4             2,1,2      1,1,1      3,2,1      1,1,1

I need to take the highest value out of each vector/cell, and replace that respective cell with the highest value. In other words, the data frame should look like this:

User          A          B          C          D
1             1          0          2          1
2             1          1          0          2
3             1          0          0          1
4             2          1          3          1

To create the first data frame, I have used aggregate based on User number. I thought about trying lapply with gsub, but that would require me to decide the specific combinations possible a priori, and that could lead to too many errors given the size of this data set.

Any help is very much appreciated!

G5W · Accepted Answer

Just split each entry and take max.

## create data 
df = read.table(text="User          A          B          C          D
1             '1,0'        '0,0'        '0,2'        '1,1'
2             '1,1,1'      '0,0,1'      '0,0,0'      '2,2,1'
3             '1'          '0'          '0'          '1'
4             '2,1,2'      '1,1,1'      '3,2,1'      '1,1,1'",
header=TRUE, stringsAsFactors=FALSE)

for(i in 2:5) {
    df[,i] = sapply(strsplit(df[,i], ","), function(x) max(as.numeric(x))) }
df
  User A B C D
1    1 1 0 2 1
2    2 1 1 0 2
3    3 1 0 0 1
4    4 2 1 3 1

R - Extract highest value in a vector and replace using that value

Answers (2)

data

Related Questions