SAS/SQL Choose random row by group

Question

I've seen a fair number of different solutions to this using MYSQL but some of the functions that they use don't work within SAS. I have a very large list of customer records that looks like this:

DATE      ID    .........  
1/31/13   1    ............  
1/31/13   2    ...........  
2/28/13   1    ............  
3/31/13   1   ............  
3/31/13   2    ............

And I need to select only 1 row per ID group. The thing is I don't want to just choose the first date for each I want to choose a random or arbitrary(whichever is easier to do) row from the available options. Does anybody know a way I could do this?

Michael Richardson · Accepted Answer

Barebones solution:

Proc SurveySelect data=Have out=Want noprint
    Method = urs
    N = 1
    outhits
    rep = 1;
    Strata Id ;
run;

Depending on your goal, the answer could change slightly. If you're bootstrapping (which it appears you may be), I found this paper quite useful: Don't Be Loopy: Re-Sampling and Simulation the SAS® Way

SAS/SQL Choose random row by group

Answers (2)

Related Questions