OpenMP: No Speedup in parallel workloads

Question

So I can't really figure this bit out with my fairly simple OpenMP parallelized for loop. When running on the same input size, P=1 runs in ~50 seconds, but running P=2 takes almost 300 Seconds, with P=4 running ~250 Seconds.

Here's the parallelized loop

double time = omp_get_wtime();

printf("Input Size: %d
", n);

#pragma omp parallel for private(i) reduction(+:in)
for(i = 0; i < n; i++) {
    double x = (double)(rand() % 10000)/10000;
    double y = (double)(rand() % 10000)/10000;
    if(inCircle(x, y)) {
        in++;
    }
}

double ratio = (double)in/(double)n;
double est_pi = ratio * 4.0;
time = omp_get_wtime() - time;

Runtimes:

p=1, n=1073741824 - 52.764 seconds

p=2, n=1073741824 - 301.66 seconds

p=4, n=1073741824 - 274.784 seconds

p=8, n=1073741824 - 188.224 seconds

Running in a Ubuntu 20.04 VM with 8 cores of a Xeon 5650 and 16gb of DDR3 EEC RAM on top of a FreeNas installation on a Dual Xeon 5650 System with 70Gb of RAM.

Partial Solution:

The rand() function inside of the loop causes the time to jump when running on multiple threads.

OpenMP: No Speedup in parallel workloads

Answers (1)

Related Questions