R - Creating a new variable based on multiple observations

Question

My dataset represents patients which have been treated multiple times. The dataset is in a long format, patients either get treatment A, C or S or a combination. A and C are never combined.

Simply put, the data looks something like this:

df <- tibble(PatientID = c(1,1,1,2,2,3,3,3,3,4,4,5,5,5,6,6),
             treatment = c("A", "A", "S", "C", "S", "S", "C", "C", NA, "C", NA, NA, "S", "A", "S", NA)

I would like to creat a new variable based on if any patient had treatment A or C or neither, so the end result looking something like:

df <- tibble(PatientID = c(1,1,1,2,2,3,3,3,3,4,4,5,5,5,6,6),
             treatment = c("A", "A", "S", "C", "S", "S", "C", "C", NA, "C", NA, NA, "S", "A", "S", "S"),
             group = c("A", "A", "A", "C", "C", "C", "C", "C", "C", "C", "C", "A", "A", "A", "S", "S"))

How can I best approach this? I'm struggling with how to deal with multiple observations per ID.

Thank you!

R - Creating a new variable based on multiple observations

Answers (1)

Related Questions