SQL a cumulative distinct count

Question

I have a SQL table that lists individual events and I am trying to aggregate to get a group of events as follows.

id |Name | Date|
0  |A    |2018-05-08
1  |A    |2018-05-09
2  |B    |2018-05-11
3  |B    |2018-05-12
4  |A    |2018-05-17
5  |A    |2018-05-17
6  |A    |2018-05-18
7  |C    |2018-05-25
8  |C    |2018-05-26
9  |B    |2018-05-27

Becomes:

Name|Group
|A  |1
|B  |2
|A  |3
|C  |4
|B  |5

This I believe is some form of a Count(), then OVER BY, which have always tripped me up. I do not know what I would even count over because there is little grouping these Names together. So far, I have the following:

select
    Name
    ,Count(Name)
from table
Group BY
    Name

Gordon Linoff · Accepted Answer

There is no reason to think of this as a gap-and-islands problem. I mean, it is, but there is a simpler solution.

In this case, use lag() and row_number():

select name, row_number() over (order by date, id) as grp
from (select t.*,
             lag(name) over (order by date, id) as prev_name
      from t
     ) t
where prev_name is null or prev_name <> name;

SQL a cumulative distinct count

Answers (2)

Related Questions