Product of columns, by row, with dynamic column input - Vectorize operation

Question

I would like to vectorize the following code in for more efficient processing. I need to take the product of columns by row (i.e. rowProds), but the number of columns I would like the product of needs to be a function of another input.

If possible I would prefer this be done using Base R, but I'm open to and appreciate any suggestions.

This can easily be done using a loop or apply family with a udf, but these are not fast enough to meet my needs.

# Generate some data

mat <- data.frame(X = 1:5)
for (i in 1:5) {
  set.seed(i)
  mat[1 + i] <- runif(5)
}

# Via a for loop

for (i in 1:nrow(mat)) {  
  mat$calc[i] <- prod(mat[match(mat$X[i], mat$X), 2:(i + 1)])
}
mat

# Via a function with mapply

rowprodfun <- function(X) {  
  myprod <- prod(mat[match(X, mat$X), 2:(X + 1)])
  return(myprod)
}

mat$calc <- mapply(rowprodfun, mat$X)
mat

mat$calc
# [1] 0.265508663 0.261370165 0.126427355 0.013874517 0.009758232

Both methods above result in the same "calc" column. I just need a more efficient way to generate this column.

akrun · Accepted Answer

One option would be to convert the upper triangle elements as NA and then use rowProds from matrixStats

library(matrixStats)
rowProds(as.matrix(mat[-1] * NA^upper.tri(mat[-1])), na.rm = TRUE)
#[1] 0.265508663 0.261370165 0.126427355 0.013874517 0.009758232

Product of columns, by row, with dynamic column input - Vectorize operation

Answers (2)

Related Questions