aggregating regression outputs in R

Question

I am performing multiple pooled cross-sectional regressions with a loop function and stored the regression outputs in a list (regression). What i would like to do now is to efficiently obtain the average coefficients, the average t-stats as well as the average adj.r squared.

I put up the following code already:

library(plm)
data("Grunfeld", package="plm")

# create list with regression outputs
regression <- list()

# Regression on past six-year subsets of Grunfeld in every year from 1940 to 1950
for(t in 1940:1950){

  regression[[as.character(t)]] <- lm(inv ~ value + capital, 
                              subset(Grunfeld, year<=t & year>=t-5))
}

This way I obtain the desired regression output stored in a list (regression). What i would like to do now is to efficiently obtain the average coefficients, the average t-stats as well as the average adj.r squared.

I already tried to calculated the mean of all the adj. r squared with the following:

mean(lapply(regression, function(x) summary(x)$adj.r.squared))

However it seems that I am using the mean function wrong as i get the following error.

Warning message:
In mean.default(lapply(regression, function(x) summary(x)$adj.r.squared)) :
  argument is not numeric or logical: returning NA

Further i came up with the following to "extract" the coefficients.

lapply(regression, function(x) summary(x)$coefficients)

How can I quickly obtain the average individual coefficients from this lapply output? (i.e extracting each row individually and calculate the respective mean over the years.)

$`1940`
               Estimate   Std. Error    t value     Pr(>|t|)
(Intercept) -3.65239712 14.647050149 -0.2493606 8.039783e-01
value        0.08283141  0.006873563 12.0507230 2.615793e-17
capital      0.11033307  0.091543522  1.2052526 2.330857e-01

$`1941`
                Estimate   Std. Error   t value     Pr(>|t|)
(Intercept) -13.77258038 16.677399231 -0.825823 4.123477e-01
value         0.08614094  0.007258571 11.867480 4.904857e-17
capital       0.18680229  0.094849038  1.969470 5.376624e-02

....

akrun · Accepted Answer

You could try:

 library(reshape2)
  dcast(melt(lapply(regression, 
       function(x) summary(x)$coefficients)), Var1~Var2, value.var="value", mean)
 #         Var1    Estimate Std. Error   t value     Pr(>|t|)
 #1 (Intercept) -16.7072859 16.0876958 -1.029145 3.320868e-01
 #2       value   0.1107460  0.0076057 14.599109 1.510115e-17
 #3     capital   0.1279743  0.0685896  1.833861 9.389504e-02

Or

 Reduce(`+`,lapply(regression, function(x) summary(x)$coefficients))/length(regression)
 #                    Estimate Std. Error   t value     Pr(>|t|)
 #(Intercept) -16.7072859 16.0876958 -1.029145 3.320868e-01
 #value         0.1107460  0.0076057 14.599109 1.510115e-17
 #capital       0.1279743  0.0685896  1.833861 9.389504e-02

aggregating regression outputs in R

Answers (2)

Related Questions