Generate group by condition on row value in column R data.table

Question

I want to split a data.table in R into groups based on a condition in the value of a row. I have searched SO extensively and can't find an efficient data.table way to do this (I'm not looking to for loop across rows)

I have data like this:

library(data.table)
dt1 <- data.table( x=1:139, t=c(rep(c(1:5),10),120928,rep(c(6:10),9), 10400,rep(c(13:19),6)))

I'd like to group at the large numbers (over a settable value) and come up with the example below:

dt.desired <- data.table( x=1:139, t=c(rep(c(1:5),10),120928,rep(c(6:10),9), 10400,rep(c(13:19),6)), group=c(rep(1,50),rep(2,46),rep(3,43)))

dayne · Accepted Answer

dt1[ , group := cumsum(t > 200) + 1]

dt1[t > 200]
#     x      t group
# 1: 51 120928     2
# 2: 97  10400     3
dt.desired[t > 200]
#     x      t group
# 1: 51 120928     2
# 2: 97  10400     3

Generate group by condition on row value in column R data.table

Answers (2)

Related Questions