Running an analysis over each column of a data.table

Question

I have a very large data table which consists of a response variable which can be zero (failure) or 1 success. In the data.table there follow very many columns, each one represents a factor by which the response rate may vary. I would like to run an identical analysis over each column of the data which will produce various things, for example a table of the response rate by that column, a histogram and so on.

An example of the table I would like to produce is shown below:

data.dt <- data.table(response=round(runif(100,0,1),0), 
                      factor1 = sample(LETTERS,100, replace=TRUE),
                      factor2 = sample(letters,100, replace=TRUE)
                     )
setkey(data.dt, factor1)
table.dt <- data.dt[,list(patients=.N,
                          successes=sum(response)
                         ), 
                     by=list(factor1)
                    ]
table.dt[,successesRate  := round(successes/patients,2)] 
table.dt

Just sticking to the table for now, how can I run this table over each column of the data.table (in this case over factor1 and factor2) and store the resulting tables separately (possibly in a list) for further reference?

Many thanks.

Running an analysis over each column of a data.table

Answers (1)

Related Questions