Indicate whether each value exists before

Question

While doing my data work I have this problem.
I have customer id, receipt_id and product_id. The product_id indicates the products that the given customer purchased at the given receipt.
The data is sorted by customer id and receipt_id. The lower value of receipt_id means the earlier shopping trip.

For each product, I want to create dummy variable that indicate whether each product is purchased in past shopping trip (in previous receipt id).
I have first three columns and want to create 4th column, "purchased_before".

I can do it by using for loop but is there any efficient way?

Data is as below,

customer id      receipt_id   product_id     purchased_before
    1             1               113                 0
    1             1               114                 0
    1             2               113                 1
    1             2               116                 0
    1             2               346                 0
    1             3               421                 0
    1             3               114                 1
    1             3               421                 0
    ....
    2             1               213                 0
    2             1               114                 0
    2             2               113                 0
    2             2               116                 0
    2             2               346                 0
    2             3               113                 1
    2             3               114                 1
    2             3               421                 0
    ....

Indicate whether each value exists before

Answers (1)

Related Questions