Recursively loop through a SQL table and find intervals based on Start and End Dates

Question

I have a SQL table that contains employeeid, StartDateTime and EndDatetime as follows:

CREATE TABLE Sample
(
    SNO INT,
    EmployeeID NVARCHAR(10),
    StartDateTime DATE,
    EndDateTime DATE
)

INSERT INTO Sample
VALUES
( 1, 'xyz', '2018-01-01', '2018-01-02' ), 
( 2, 'xyz', '2018-01-03', '2018-01-05' ), 
( 3, 'xyz', '2018-01-06', '2018-02-01' ), 
( 4, 'xyz', '2018-02-15', '2018-03-15' ), 
( 5, 'xyz', '2018-03-16', '2018-03-19' ),
( 6, 'abc', '2018-01-16', '2018-02-25' ),
( 7, 'abc', '2018-03-08', '2018-03-19' ),
( 8, 'abc', '2018-02-26', '2018-03-01' )

I want the result to be displayed as

EmployeeID  |  StartDateTime  |  EndDateTime
------------+-----------------+---------------
   xyz      |  2018-01-01     |  2018-02-01
   xyz      |  2018-02-15     |  2018-03-19
   abc      |  2018-01-16     |  2018-03-01
   abc      |  2018-03-08     |  2018-03-19

Basically, I want to recursively look at records of each employee and datemine the continuity of Start and EndDates and make a set of continuous date records.

I wrote my query as follows:

SELECT * 
FROM dbo.TestTable T1 
LEFT JOIN dbo.TestTable t2 ON t2.EmpId = T1.EmpId
WHERE t1.EndDate = DATEADD(DAY, -1, T2.startdate)

to see if I could decipher something from the output looking for a pattern. Later realized that with the above approach, I need to join the same table multiple times to get the output I desire.

Also, there is a case that there can be multiple employee records, so I need direction on efficient way of getting this desired output.

Any help is greatly appreciated.

TomC · Accepted Answer

This will do it for you. Use a recursive CTE to get all the adjacent rows, then get the highest end date for each start date, then the first start date for each end date.

;with cte as (
    select EmployeeID, StartDateTime, EndDateTime 
    from sample s
    union all
    select CTE.EmployeeID, CTE.StartDateTime, s.EndDateTime
    from sample s
    join cte on cte.EmployeeID=s.EmployeeID and s.StartDateTime=dateadd(d,1,CTE.EndDateTime)
)
select EmployeeID, Min(StartDateTime) as StartDateTime, EndDateTime from (
    select EmployeeID, StartDateTime, Max(EndDateTime) as EndDateTime from cte
    group by EmployeeID, StartDateTime
) q group by EmployeeID, EndDateTime

Recursively loop through a SQL table and find intervals based on Start and End Dates

Answers (2)

Related Questions