OpenMP For Loop gets slow by increasing threads

Question

I have a simple for loop over an array. It gets slow when I am using more processors. Here is the code:

#include 
#include 
#include 
#include 
#include 

using namespace std;

int main(int argc, char* argv[])
{
    string nth;
    if(argc<2)
    {
         cout << "Not enough parameters have been passed. 
";
         cin.get();
         exit(0);
    }
    else
    {
       nth=argv[1];
    }

    N=1000;
    vector > I;
    int *array= new int[N];
    // Initialize I and array

    struct timeval time_start;
    gettimeofday(&time_start, NULL);
    for (int y=0; y



I compile it as:

g++ test.cpp -fopenmp -o outTestPar -std=c++0x


and run it by: 

./outTestPar 2


I run it on a machine with 64 cores. I get this as results:

With 2 processor: 

[...]$ ./outTestPar 2
Section Time: 28003
[...]$ ./outTestPar 2
Section Time: 20897
[...]$ ./outTestPar 2
Section Time: 19506
[...]$ ./outTestPar 2
Section Time: 22990


With 4 processor:

[...]$ ./outTestPar 4
Section Time: 20362
[...]$ ./outTestPar 4
Section Time: 19963
[...]$ ./outTestPar 4
Section Time: 28147
[...]$ ./outTestPar 4
Section Time: 20857


With 8 processor:

[...]$ ./outTestPar 8
Section Time: 24881
[...]$ ./outTestPar 8
Section Time: 28056


With 16 processor:

[...]$ ./outTestPar 16
Section Time: 24332
[...]$ ./outTestPar 16
Section Time: 26921


With 32 processor:

[...]$ ./outTestPar 32
Section Time: 21858
[...]$ ./outTestPar 32
Section Time: 23367
[...]$ ./outTestPar 32
Section Time: 25200
[...]$ ./outTestPar 32
Section Time: 24813


As you can see, not only is there no improvement, sometimes it gets worse. Any idea what's going on? 
How can I improve it? 
I also tried different schedules (static, dynamic, guided). Didn't work and made it worse.

OpenMP For Loop gets slow by increasing threads

Answers (1)

Related Questions