rbinom (or else) by row and group in data.table

Question

I want to draw Y for each row in group g in a data.table. To illustrate, take this simple simulation:

set.seed(123)

N = 50
g = 3

DT = data.table(id = rep(1:N,g),
                group_id = sort(rep(1:g, N)),
                p = runif(150, min = 0, max = 1)
                )

DT[, Y := rbinom(n = .N, size = 1, prob = p), by = group_id]  

DTs = split(DT, by = "group_id")

DTs = rbindlist(DTs[1], idcol=T)

DTs[, Y := rbinom(n = .N, size = 1, prob = p)]

head(DT)
head(DTs)

Why does Y differ for DT and DTs? I thought this to be equivalent.

> head(DT)
   id group_id         p Y
1:  1        1 0.2875775 1
2:  2        1 0.7883051 1
3:  3        1 0.4089769 0
4:  4        1 0.8830174 1
5:  5        1 0.9404673 1
6:  6        1 0.0455565 0


> head(DTs)
   .id id group_id         p Y
1:   1  1        1 0.2875775 1
2:   1  2        1 0.7883051 1
3:   1  3        1 0.4089769 1
4:   1  4        1 0.8830174 1
5:   1  5        1 0.9404673 1
6:   1  6        1 0.0455565 0

rbinom (or else) by row and group in data.table

Answers (1)

Related Questions