R Data Table: Interate over/compare rows

Question

I'm working with a large data table in R, and am trying to loop over the entire table and set row values in a given column based on the previous row's value in a separate column.

I'm attempting to run this loop on a table with 200K rows, and it's moving very slowly. I suspect I'm not taking advantage of all data.table's efficiencies, but don't know where I might improve things.

My code's below. My table is "DATA", my keys are columns "x" and "y", and I'm attempting to loop through all rows and set the value of rows in column 6 to 1 only if that row's value in column 2 is not equal to the previous row's value in column 2.

setkey(DATA,x,y)
for (i in 2:nrow(DATA)) {

    if (DATA[i,2]!=DATA[i-1,2]){
        DATA[i, 6] = 1
    }

}

Again, this works, but is very slow for large tables. Any help would be much appreciated -- thank you!

Matthew Lundberg · Accepted Answer

Without seeing data, here's a stab (which does not use data.table):

DATA[c(0, diff(DATA[,2]))!=0, 6] <- 1

If the first row is considered "not equal":

DATA[c(1, diff(DATA[,2]))!=0, 6] <- 1

R Data Table: Interate over/compare rows

Answers (2)

Related Questions