Run a function across multiple columns

Question

I'm trying to clean a sixty column data extract that has been given to me. Part of the data is about thirty columns which have been supplied as "Yes" or "No" values that I would like to convert to logical type. It therefore isn't every column in the data frame, but it is a lot of them. I'm currently doing the equivalent of this:

mtcars %>%
  mutate(mpg = as.character(mpg)) %>%
  mutate(cyl = as.character(cyl)) %>%
  mutate(disp = as.character(disp)) %>%
  mutate(hp = as.character(hp))

That is, manually mutating each column in the list. But that feels like it will be prone to error from missing a copy-paste or similar. Is there a function that could do this in one step by being passed a list of field names? I tend to default to tidyverse functions, though base R also works if needed.

Ronak Shah · Accepted Answer

This should be a duplicate but cannot find a relevant post right now.

We can use mutate_at and apply function on selected columns

library(dplyr)
mtcars %>% mutate_at(vars(mpg, cyl, disp, hp), as.character)

Or if we have column names stored in vector called cols we could do

cols <- c("mpg", "cyl", "disp", "hp")
mtcars %>% mutate_at(cols, as.character)

Run a function across multiple columns

Answers (2)

Related Questions