Delete group when its observation does not contain certain values in SAS

Question

Refer to below table, an ID is considered as complete if at least one of its group having Day 1 to Day 3 (Duplicate allowed). I need to remove ID which has Group not having full Day 1 to Day 3.

ID   Group     Day
1     A        1
1     A        1
1     A        2
1     A        3
1     B        1
1     B        3
2     A        1
2     A        3
2     B        2

Expected result

ID   Group     Day
1     A        1
1     A        1
1     A        2
1     A        3
1     B        1
1     B        3

With this reference, Delete the group that none of its observation contain the certain value in SAS I have tried below code but it cannot remove ID 2.

PROC SQL;
CREATE TABLE TEMP AS SELECT
* FROM HAVE
GROUP BY ID
HAVING MIN(DAY)=1 AND MAX(DAY)=3
;QUIT;

PROC SQL;
CREATE TABLE TEMP1 AS SELECT
* FROM TEMP WHERE ID IN
(SELECT ID FROM TEMP
WHERE DAY=2)
;QUIT;

Tom · Accepted Answer

So you want to find the set of ID values where the ID has at least one GROUP that has all three DAY values? Find the list of IDs as a subquery and use it to subset the original data.

The key thing in subquery is you want there to be 3 distinct values of DAY. If your data could have other values of DAY (like missing or 4) then use a WHERE clause to only keep the values you want to count.

proc sql;
create table want as
  select * from have
  where id in 
   (select id from have 
    where day in (1,2,3)
    group by id,group
    having count(distinct day)=3
   )
;
quit;

Delete group when its observation does not contain certain values in SAS

Answers (2)

Related Questions