How to convert lapply output to a single matrix in R

Question

I have a list of data frames, organized by year. I am using lapply to get the summary for a single variable in each data frame. The output follows the list and gives a summary for each year, one by one. However, I want the output in the form of a single table with years for rows. How do I do this? An example using the iris dataset shows my problem:

x <- split(iris$Sepal.Length, iris$Species)
lapply(x, summary)

And the output is:

$setosa
   Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
   4.300   4.800   5.000   5.006   5.200   5.800

Similarly for the other two.

I want the output organized as a single table like with:

> sapply(x, summary)
        setosa versicolor virginica
Min.     4.300      4.900     4.900
1st Qu.  4.800      5.600     6.225
Median   5.000      5.900     6.500
Mean     5.006      5.936     6.588
3rd Qu.  5.200      6.300     6.900
Max.     5.800      7.000     7.900

But with setosa, versicolor, virginica (or years in my case) on the left and Min... Max up top. I can flip the axes around in ggplot, but reading the table as-is is more intuitive with the years on the left. I came across a number of discussions about converting lapply output but the ones I came across were all measuring a single stat like mean or median. Thanks.

Rich Scriven · Accepted Answer

This seems like a good time to use by(). It eliminates the need for the call to split(), is all done in one line, and returns a matrix.

with(iris, do.call(rbind, by(Sepal.Length, Species, summary)))
#            Min. 1st Qu. Median  Mean 3rd Qu. Max.
# setosa      4.3   4.800    5.0 5.006     5.2  5.8
# versicolor  4.9   5.600    5.9 5.936     6.3  7.0
# virginica   4.9   6.225    6.5 6.588     6.9  7.9

If you still wish to use manual split-apply-combine method, then it would be

do.call(rbind, lapply(x, summary))

How to convert lapply output to a single matrix in R

Answers (2)

Add-on

Related Questions