Merge Sort Time in C

Question

This is my virtual machine:

CPU: 4 cores
RAM: 4096 MB
Operating System: Ubuntu 18.04 (64-bit)

Problem1: Why there is a threshold 4193790?

I write a merge sort in C:

#include
#include
#include
#include
#include"helper.c"
#define N 4193789

void merge(double* li,int left1,int right1,int left2,int right2,int size){
    double *li_tmp;
    li_tmp = (double *)malloc(sizeof(double)*size);
    int i = left1;
    int j = left2;
    int k = left1;

    while(i<=right1 && j<=right2){ 
        if(li[i] < li[j]){
            li_tmp[k] = li[i++]; 
        }     
        else{
            li_tmp[k] = li[j++];
        } 
        k++; 
    }
    if(i>right1){
        while(j<=right2){
            li_tmp[k++] = li[j++];
        }
    }
    else if(j>right2){
        while(i<=right1){
            li_tmp[k++] = li[i++];
        }
    }

    for(i=left1; idata[i+1]){
            correct = false;
        }
    }
    if (correct){
        printf("Correct!
");
    }else{
        printf("Not correct!
");
    }

    printf("time spent=%12.10f
",delta);
}

This is my helper.c, just generate a random double array in double *a.

#include
#include
#include

void gen_rand(double *a, int num){
    for (int i=0;i


I found a very weird scenario:

I know that merge sort is O(nlogn), so when I experiment with different array lengths, I find that the time consumption varies a lot in one area, and it doesn't fit O(nlogn).After many attempts, I found a threshold.

When I define N as 4193789, the time is 1s, but when I change N to 4193790, the time will increase to 34s!

I wonder why there is such a threshold.
vagrant@hang2:~/data/$ gcc -fopenmp merge_sort_main.c 
vagrant@hang2:~/data/$ ./a.out 
Correct!
time spent=1.1103340000
vagrant@hang2:~/data/$ gcc -fopenmp merge_sort_main.c 
vagrant@hang2:~/data/$ ./a.out 
Correct!
time spent=34.5053590000


Problem2: Why the omp method get slower when a big array (more than 4193790)?
Another problem with omp:
This is my omp main :
#pragma omp parallel num_threads(4)
    {
    #pragma omp single 
        merge_sort_omp(data, 0, N-1, N);
    }

And merge_sort_omp():
void merge_sort_omp(double* li,int left,int right,int size){
    if (left10000){        
            int mid = (left + right)/2;
            #pragma omp task firstprivate (li, left, mid)
            merge_sort_omp(li,left,mid,size);
            #pragma omp task firstprivate (li, mid, right)
            merge_sort_omp(li,mid+1,right,size);
            #pragma omp taskwait
            merge(li,left,mid,mid+1,right,size);
        }else{
            int mid = (left + right)/2;
            merge_sort_omp(li,left,mid,size);
            merge_sort_omp(li,mid+1,right,size);
            merge(li,left,mid,mid+1,right,size);
        }
    }
}

I tried N=4000000 and N=4193790 as follows:
vagrant@hang2:~/data$ gcc -fopenmp merge_sort_main.c 
vagrant@hang2:~/data$ ./a.out 
Correct!
time spent=1.1358180000
vagrant@hang2:~/data$ gcc -fopenmp merge_sort_omp_main.c 
vagrant@hang2:~/data$ ./a.out 
Correct!
time spent=0.4998150000
vagrant@hang2:~/data$ gcc -fopenmp merge_sort_main.c 
vagrant@hang2:~/data$ ./a.out 
Correct!
time spent=34.3504340000
vagrant@hang2:~/data$ gcc -fopenmp merge_sort_omp_main.c 
vagrant@hang2:~/data$ ./a.out 
Correct!
time spent=111.9368700000


I want to know why the parallel code is twice as fast as the serial code at N= 4000000, but the serial code is slower at N=4193790. Almost three times slower. I want to know why the omp get slower?

Merge Sort Time in C

Problem1: Why there is a threshold 4193790?

Problem2: Why the omp method get slower when a big array (more than 4193790)?

Answers (1)

Related Questions