Using lapply to sum a subset of a dataframe

Question

I'm quite new to R and using lapply. I have a large dataframe and I'm attempting to use lapply to output the sum of some subsets of this dataframe.

group_a	group_b	n_variants_a	n_variants_b
1	NA	1	2
NA	2	5	4
1	2	2	0

I want to look at subsets based on multiple different groups (group_a, group_b) and sum each column of n_variants.

Running this over just one group and n_variant set works:

sum(subset(df, (!is.na(group_a)))$n_variants_a

However I want to sum every n_variant column based on every grouping. My lapply script for this outputs values of 0 for each sum.

summed_variants <- lapply(list_of_groups, function(g) {
              lapply(list_of_variants, function(v) {
                sum(subset(df, !(is.na(g)))$v)

I was wondering if I need to use paste0 to paste the list of variants in, but I couldn't get this to work.

Thanks for your help!

Using lapply to sum a subset of a dataframe

Answers (1)

data

Related Questions