Sequentially store the results of a series of regressions into a dataframe

Question

Suppose I want to run a series of regressions, like so:

summary(lm(mpg ~ cyl, data = mtcars))
summary(lm(mpg ~ disp, data = mtcars))
summary(lm(mpg ~ wt, data = mtcars))

I want to create a data frame that contains the estimates and standard errors of each of these outputs, preferably with the variable name included. So the ultimate output should look like this:

Variable  Beta  Coeff
cyl       -2.8  .32
disp      -.04  .004
wt        -5.3  .56

I presume it will require a function. Any ideas out there?

MrFlick · Accepted Answer

One easy way would be to use the purrr and broom packages in the tidyverse.

library(purrr)
library(broom)
cols <- c("cyl", "disp", "wt")

map_df(cols, ~lm(reformulate(.x, "mpg"), data=mtcars) %>% tidy())
#   term        estimate std.error statistic  p.value
#                           
# 1 (Intercept)  37.9      2.07        18.3  8.37e-18
# 2 cyl          -2.88     0.322       -8.92 6.11e-10
# 3 (Intercept)  29.6      1.23        24.1  3.58e-21
# 4 disp         -0.0412   0.00471     -8.75 9.38e-10
# 5 (Intercept)  37.3      1.88        19.9  8.24e-19
# 6 wt           -5.34     0.559       -9.56 1.29e-10

This gives you some extra info but you could easily filter it out with dplyr if you like.

Sequentially store the results of a series of regressions into a dataframe

Answers (2)

Related Questions