SAS : Eliminate duplicates if a condition is satisfied

Question

I want to eliminate duplicates from a database, based on an identifier, an order and a condition.

More precisely, I have data with several observations. I have sometimes a condition that makes me want to keep that observation anyway (let fix it condition=1), but then also keep the observation with the same identifier even if this condition does not hold (condition=0).

But if I have for one identifier several observations where condition=0 then I want to elminate duplicates, with criterion being having the greatest order.

Without the condition I can do that

proc sort data=have;
    by identifier descending order;
run;

proc sort nudopkey data=have;
    by identifier;
run;

But how to incorporate my condition in this ?

Edit 1 : add a database example :

data Test; 
   input identifier $ order condition; 
   datalines;
1023 1 0
1023 2 0
1064 2 0
1064 1 0
1098 1 0
1098 1 1
;

Then I want to keep

1023 2 0
1064 2 0
1098 1 0
1098 1 1

Edit 2 : tried to precise my conditions

Chris · Accepted Answer

I presume you want to eliminate duplicates only when the condition for all records for an identifier is set to 0. In that case you want to keep the record with the maximum order and eliminate all other records with the same identifier.

Proc sql; 
         create  table   want    as 
         select  * 
         from    test 
         group   by      identifier 
         having  max (condition) ne      0 
         or      order           eq      max (order) 
         ; 
Quit;

This will keep all rows for an Identifier where the maximum condition = 1, or in the case of those where maximum condition = 0, select the row with the maximum order.

Is that what you want?

SAS : Eliminate duplicates if a condition is satisfied

Answers (2)

Related Questions