Algorithm for distributing workload in a thread pool

Question

Let's imagine that we have T number of threads and we want to distribute a problem of size N to those threads. Every thread will chose a part of that problem to execute it. Each thread will use the thread_id (a number from 0 to T-1), the T and the N in order to calculate the range of the sub-problem. Let's say that the range of the sub-problem is [S, E) where S and E belong to [0, N].

For example. Let's say we have an array of integers. The size of the array is 10. We want increase each element of that array by one and we want to do that in parallel using 4 threads.

The 1st thread with thread_id==0 will use range [0, 3)
The 2nd thread with thread_id==1 will use range [3, 6)
The 3rd thread with thread_id==2 will use range [6, 8)
The 4th thread with thread_id==3 will use range [8, 10)

Does anyone know a fast algorithm that will calculate those ranges? Preferably without atomics or branches.

ciamej · Accepted Answer

If I understand correctly you're looking for such an equation?

S = floor(thread_id * N/T)
E = floor((thread_id + 1) * N/T)

If you multiply first (threadId * N) and divide later (/N) you can use integers for the computations and floor function is not necessary.

Algorithm for distributing workload in a thread pool

Answers (2)

Related Questions