Conditionally impute values in a group based on existing values in R

Question

After grouping my data, I would like to impute values of a column in both directions based on one or more existing values. I have the following data:

id   inw    test    found
001    0       1     
001    1       2     
001    1       3       No
001    1       4       No
002    1       1       No
002    0       2     
002    1       3      Yes
002    1       4       No
003    1       1      Yes
003    1       2      Yes
003    1       3       No
003    0       4

The data is grouped by id and the found column is in question here. Essentially, if a value of 'No' appears within a group, all previous values should be changed to 'No'. If 'Yes' appears, all subsequent values should change to 'Yes'.

Therefore, the final result should look like this:

id   inw    test    found
001    0       1       No
001    1       2       No
001    1       3       No
001    1       4       No
002    1       1       No
002    0       2     
002    1       3      Yes
002    1       4      Yes
003    1       1      Yes
003    1       2      Yes
003    1       3      Yes
003    0       4      Yes

Critically, NA's can remain if there is not a 'No' value anywhere after it it, or no 'Yes' values anywhere before it within the group (highlighted above in group id == 002).

Any help would be greatly appreciated!

Thanks, Dylan

ekoam · Accepted Answer

Something like this?

library(dplyr)

fill_na <- function(x) {
  if (length(first_yes <- head(which(x == "Yes"), 1L)) > 0L)
    x[seq.int(first_yes, length(x), 1L)] <- "Yes"
  if (length(last_no <- tail(which(x == "No"), 1L)) > 0L)
    x[seq.int(1L, last_no, 1L)] <- "No"
  x
}

df %>% group_by(id) %>% mutate(found = fill_na(found))

Output

# A tibble: 12 x 4
# Groups:   id [3]
   id      inw  test found
      
 1 001       0     1 No   
 2 001       1     2 No   
 3 001       1     3 No   
 4 001       1     4 No   
 5 002       1     1 No   
 6 002       0     2 NA   
 7 002       1     3 Yes  
 8 002       1     4 Yes  
 9 003       1     1 Yes  
10 003       1     2 Yes  
11 003       1     3 Yes  
12 003       0     4 Yes

Conditionally impute values in a group based on existing values in R

Answers (2)

Related Questions