advanced row deleting in R

Question

I am looking to do row deleting in R based on advanced selection logic (i.e. not just a simple subset). Here is some sample code and what I need to do

v1 <- c(1:11)
v2 <- c('a','a','b','b','b','b','c','c','c','c','c')
v3 <- c(3,13,14,13,14,9,14,13,14,13,14)
v4 <- c('','x','','','','x','','','','','x')
v5 <- c('','x','','y','','x','','y','','y','x')

test.df <- data.frame(v1,v2,v3,v4,v5)
names(test.df) <- c('id','level','number','end_flag','logic_flag')

What I want to do is remove all the rows for each specific level underneath where the first logic flag is equal to 'y'.

So in this case, the end result should remove no rows for level a, rows 5 and 6 for level b, and rows 9,10,11 for level c.

Basically, want to make the first '13' that comes up in the number column for each level the end_flag equal to 'x' and then delete all the rows for that level underneath the end_flag = 'x' Let me know if this makes sense as I need to clean this part up before proceeding with the rest of my code!

Thanks!

thelatemail · Accepted Answer

Base R using cumsum twice:

posty <- function(x) cumsum(cumsum(x))<=1
test.df[with(test.df, ave(logic_flag=="y", level, FUN=posty)),]

#  id level number end_flag logic_flag
#1  1     a      3                    
#2  2     a     13        x          x
#3  3     b     14                    
#4  4     b     13                   y
#7  7     c     14                    
#8  8     c     13

advanced row deleting in R

Answers (2)

Related Questions