Selecting rows from a column which name is in a variable

Question

I have a dataset with a named column. The name of the column is also in a variable. I would like to be able to select all rows that satisfy a condition on the column "col".

Here I would like to select all rows whose value in the "col" column matches the condition "< 2".

name = "col"
dataset = data.frame(col = 1:3)

I tried to use "eval" in subset, or the "select" function of the dplyr package, but it does not do what I want (or I misused it).

Is there a simple way to do this?

Eric Burden · Accepted Answer

If you're new(ish) to R, I'm going to recommend using the tidyverse set of packages, including the ever-useful dplyr for problems like this so you can have more immediately readable and understandable code. You can get this package using install.packages('tidyverse'). With that installed, to answer your question:

library(dplyr)

df <- data.frame(
  col = c(0:10),
  another_col = c(10:20),
  third_col = c(25:35)
)

dynamic_name <- "col"

filter_at(df, dynamic_name, ~ .x < 2)

Note: The tidyverse family of packages typically accept the formula syntax (that ~ expression) as a way to introduce anonymous (lambda) functions, so ~ .x < 2 is a function that returns TRUE if the value passed in is less than 2).

Selecting rows from a column which name is in a variable

Answers (2)

Related Questions