How to use lapply to find closest value in a list in R?

Question

I'm trying to find the model-predicted value closest to a real observed value within a large dataframe. I believe I need to use lapply, but I'm really not sure. Thanks in advance, SE, and sorry if this is a repeat of a previous post, I looked.

df <- data.frame(pred = rnorm(50, mean = 100, sd = 10),
                 cand = I(replicate(50, exp = I(list(rnorm(6, mean = 100, sd = 10))))))

So far, I've come up with a 1-line function that works when run on a single row, but I have two problems:

df$closest <- sapply( df, function(x) { which.min( abs( df$pred[x] - df$cand[[x]] ) ) } )

This function won't work on the full list, probably because I am new to the apply family.
This function returns a list position, not the actual value, which is what I need.

Error in df$cand[[x]] : no such index at level 1

Badger · Accepted Answer

apply allows us to operate on the rows, or the columns, because you are looking to loop through the rows, a margin of 1 (rows) should get the job done!

We could use apply:

df$closest <- apply( df,MARGIN = 1, function(x) { which.min( abs( x$pred - x$cand ) ) } )

How to use lapply to find closest value in a list in R?

Answers (2)

Related Questions