Create column with (for each row) maximum value in a subset based on values in that row

Question

I have a data frame df with three columns. Two input columns input1 and input2 and an output column.

I want to create a new column with the maximum value in output within a subset of df which is based on all rows in which input1 and input2 are below or equal to the respective input values in the respective row.

I managed to do that in a for loop easily:

output <- c(1:10)
input1 <- c(5,5,10,10,7,7,20,9,12,18)
input2 <- c(8,6,16,16,8,20,21,12,30,21)

df <- as.data.frame(cbind(output, input1, input2))
  
  
  for (i in 1:nrow(df)){
    df[i,"max"] <- max(df$output[df$input1 <= df$input1[i] &
                                         df$input2 <= df$input2[i]])
  }

However, it is not feasible with my original data with up to 1.000.000 observations.

Is there any option with apply or within data.table to speed up this process?

Create column with (for each row) maximum value in a subset based on values in that row

Answers (1)

Related Questions