New column with row difference based on condition

Question

I would like to create a new column difference in a data.frame according to condition, for example I have this data.frame :

    structure(list(ID = c(1, 1, 2, 2), Condition = c("a", "b", "a", 
"b"), Value = c(20, 30, 50, 45)), class = "data.frame", row.names = c(NA, 
-4L))

  ID Condition Value
1  1         a    20
2  1         b    30
3  2         a    50
4  2         b    45

Then for each ID, I would like to obtain a new column with Value when Condition = a and Value difference b-a when Condition = b. On other words, I would like to obtain this but I'm struggling :

  ID Condition Value Diff
1  1         a    20   20
2  1         b    30   10
3  2         a    50   50
4  2         b    45   -5

How would you proceed to do this ? Thanks

Karthik S · Accepted Answer

Will this work:

library(dplyr)
df %>% 
  arrange(ID, Condition) %>% 
   mutate(Diff = case_when(Condition == 'a' ~ Value, 
                            TRUE ~ Value - lag(Value)))

   ID Condition Value Diff
1  1         a    20   20
2  1         b    30   10
3  2         a    50   50
4  2         b    45   -5

New column with row difference based on condition

Answers (2)

Related Questions