How can I find union of ranges from a multiple group efficiently?

Question

I was creating a report of a common time range in which all the given processes executed concurrently, what I am always doing is drawing a graph and figuring this out manually. Now I have more data and drawing a graph won't be an optimal solution, as a computer engineer, I would like to solve this programmatically, using an optimal and efficient algorithm.

So the question is basically,

There are N processes each has a list of time ranges it has executed, what is the best way to find all the common time intervals in all the given processes was there.

Example:

Let's say there are three processes P1, P2, P3, below is the table giving information about the execution time information of each process:

-|---------|--------------------|-
 | PROCESS |  TIME OF EXECUTION |
-|---------|--------------------|-
 |    P1   |   (0,4) , (5,10)   | **(updated from (0,1) , (5,10))
-|---------|--------------------|-
 |    P2   |   (0,6) , (1,8)    |
-|---------|--------------------|-
 |    P3   |   (3,10)           |
-|---------|--------------------|-

If I draw a graph using each process in the Y-axis and time-range in X-axis I can get something like this below, and the common range is highlighted with blue border color in it.

NOTE: P1 values are (0, 4) and (5,10)

From the graph, it's obvious the output/result is [ (3, 4), (5, 8) ].

I am looking for a solution/algorithm to deduce these results from the given input. I would appreciate an answer that addresses the following aspects the problem:

A general theoretical solution for a huge number of processes & time ranges per process.
Is it a Dynamic Programming problem?
Is there any algorithm I can use in this scenario?
Is it similar to Largest Rectangle in Histogram problem?

Phenomenal One · Accepted Answer

This can be solved by maintaining a prefix sum and checking its count.

But before this, you need to some basic prepocessing.

For each time of execution for a process , you need to merge the overlapping intervals.

This is a very common algo which you could look it up.

So, after this step, your input becomes,

-|---------|--------------------|-
 | PROCESS |  TIME OF EXECUTION |
-|---------|--------------------|-
 |    P1   |   (0,4) , (5,10)   |
-|---------|--------------------|-
 |    P2   |   (0,8)            |
-|---------|--------------------|-
 |    P3   |   (3,10)           |
-|---------|--------------------|-

For P2 what we have done here is merge (0,6) and (1,8) and made it as (0,8).

Now, the next step is the actual algorithm to calculate prefix sum.

It works in the following way.

1. initialize a map
2. for each start_time,end_time.
      map[start_time++] and map[[(end_time+1)]--]

Reason for (end_time+1) is because the process runs till end_time. After this end_time , we can remove its count from our list.

So, now we have our values.

Now, we can start couting the prefix sums.

To do this, we initialize a counter sum=0. and have the keys of map in sorted order.

num_processes=3 is initialized as well!. `

at 0, val = 2 , sum =2 (!=3) .(start=-1 and end =-1)
at 3, val = 1,  sum =3 // intersection start (start=3, end=-1)
at 5, val = 0,  sum =3 (remains same) // intersection end and start new one (add (3,4) and (start=5,end=-1) 
at 9 , val = -1, sum = 2 // intersection end (add 5,8) (start=-1 and end =-1)
at 11, val = -2, sum = 0 // (start=-1 and end =-1)

add (3,4),(5,8) to the final result.

How can I find union of ranges from a multiple group efficiently?

Answers (2)

Related Questions