Reputation: 42390
I have some code which I am currently optimizing for concurrency in multicore architectures. In one of my classes, I found a nested foreach
loop. Basically the outer loop iterates through an array of NetworkInterface
objects. The inner loop iterates though the network interfaces IP addresses.
It got me thinking, is having Nested Parallel.ForEach
loops necessarily a good idea? After reading this article (Nested Parallel.ForEach Loops on the same list?) I am still unsure what applies where in terms of efficiency and parallel design. This example is taking about Parallel.Foreach
statements being applied to a list where both loops are performing operations on that list.
In my example, the loops are doing different things, so, should I:
Parallel.ForEach
loops?Parallel.ForEach
on the parent loop and leave the inner loop as-is?Upvotes: 27
Views: 32569
Reputation: 1711
I just came from a very hard time debugging nested Parallel.ForEach
within an ASP.NET application (.NET Framework 4.8.1).
After 3 days, I found evidence that nested Parallel.ForEach
statements will not obey the rule of running localInit
and localFinally
at the start/end of each thread (for each partition).
Somehow, it will share the same thread to process items of both the parent and the nested Parallel.ForEach
statements, running localInit
and localFinally
out of order, meaning: it will run localInit
once, start processing the parent items, then run localInit
again, within the same thread, to start processing the nested items. Then it will run localFinally
to finalize the processing of the parent or the nested loop and, finally, will run localFinally
again to finish.
This will cause major issues if you are dealing, for example, with shared ThreadStatic<T>
variables (which was my case).
So, my recommendation is: DO NOT nest Parallel.ForEach
statements unless you know exactly what you're doing.
Upvotes: 1
Reputation: 43921
My advice is to follow the second approach: Parallelize only the outer loop, and keep the inner loops sequential (for
/foreach
). Don't place Parallel.ForEach
loops the one inside the other. The reasons are:
The parallelization adds overhead. Each Parallel
loop has to synchronize the enumeration of the source
, start Task
s, watch cancellation/termination flags etc. By nesting Parallel
loops you are paying this cost multiple times.
Limiting the degree of parallelism becomes harder. The MaxDegreeOfParallelism
option is not an ambient property that affects child loops. It limits only a single loop. So if you have an outer Parallel
loop with MaxDegreeOfParallelism = 4
and an inner Parallel
loop also with MaxDegreeOfParallelism = 4
, the inner body
might be invoked concurrently 16 times (4 * 4
). It is still possible to enforce a sensible upper limit by configuring all loops with the same TaskScheduler
, and specifically with the ConcurrentScheduler
property of a shared ConcurrentExclusiveSchedulerPair
instance.
In case of an exception you'll get a deeply nested AggregateException
, that you'll have to Flatten
.
I would also suggest considering a third approach: do a single Parallel
loop on a flattened source sequence. For example instead of:
ParallelOptions options = new() { MaxDegreeOfParallelism = X };
Parallel.ForEach(NetworkInterface.GetAllNetworkInterfaces(), options, ni =>
{
foreach (UnicastIPAddressInformation ip in ni.GetIPProperties().UnicastAddresses)
{
// Do stuff with ni and ip
});
});
...you could do this:
var query = NetworkInterface.GetAllNetworkInterfaces()
.SelectMany(ni => ni.GetIPProperties().UnicastAddresses, (ni, ip) => (ni, ip));
Parallel.ForEach(query, options, pair =>
{
(ni, ip) = pair;
// Do stuff with ni and ip
});
This approach parallelizes only the Do stuff
. The calling of ni.GetIPProperties()
is not parallelized. The IP addresses are fetched sequentially, for one NetworkInterface
at a time. It also intensifies the parallelization of each NetworkInterface
, which might not be what you want (you might want to spread the parallelization among many NetworkInterface
s). So this approach has characteristics that make it compelling for some scenarios, and unsuitable for others.
One other case worth mentioning is when the objects in the outer and inner sequences are of the same type, and have a parent-child relationship. In that case check out this question: Parallel tree traversal in C#.
Upvotes: 1
Reputation: 33149
A Parallel.ForEach does not necessarily execute in parallel -- it is just a request to do so if possible. Therefore, if the execution environment does not have the CPU power to execute the loops in parallel, it will not do so.
If the actions on the loops are not related (i.e., if they are separate and do not influence each other), I see no problem using Parallel.ForEach both on inner and outer loops.
It really depends on the execution environment. You could do timing tests if your test environment is similar enough to the production environment, and then determine what to do. When in doubt, test ;-)
Good luck!
Upvotes: 34
Reputation: 4795
The answer will be, it depends;
Threads are not cheap, they take time to create, and memory to exist. If you're not doing something computationally expensive with those IP Addresses, and using the wrong type of collection for concurrent access, you're almost certainly slowing down your application.
Use StopWatch
to help you answer these questions.
Upvotes: 3