Changing values on data subset based on conditions R

Question

I ran into an issue when I am trying to manually change some values

Here is my data set

dat <- read.table(text='
id  Item                                                 Category    Next_Category
 1  "CRANBERRY 10PKTS CARTON, BLUEBERRY 20PKTS CARTON"   2           2
 1  "CRANBERRY 10PKTS CARTON, BLUEBERRY 20PKTS CARTON"   2           1
 1  "CRANBERRY 10PKTS CARTON"                            1           1
 1  "CRANBERRY 10PKTS CARTON"                            1           2
 1  "CRANBERRY 10PKTS CARTON, BLUEBERRY 20PKTS CARTON"   2           NA
', header=TRUE)

You can see that row 3 and row 4 has Category of 1. The conditions would be that row 3 and row 4 has values that can be found in the previous row (row 2), and that they continue to the next row (row 5). If so, they actually belong to Category 2 instead of Category 1 (yeah I know it is strange, but this is a requirement to treat them as the same).

I have multiple ids. I would like to only identify this kind of subset of data to achieve the desired outcome.

I have experimented with the idea of taking lag values of the Category to create an identifier on every decrease in the number from the Category. Let's ignore the scenario where there is an increase in the number from the Category first.

Expected output would be:

id  Item                                                 Category    Next_Category
 1  "CRANBERRY 10PKTS CARTON, BLUEBERRY 20PKTS CARTON"   2           2
 1  "CRANBERRY 10PKTS CARTON, BLUEBERRY 20PKTS CARTON"   2           1
 1  "CRANBERRY 10PKTS CARTON"                            2           1
 1  "CRANBERRY 10PKTS CARTON"                            2           2
 1  "CRANBERRY 10PKTS CARTON, BLUEBERRY 20PKTS CARTON"   2           NA

Many thanks in advance!

Changing values on data subset based on conditions R

Answers (1)

data

Related Questions