Why can't I use gpu to reduce the cpu occupancy rate?

Question

From cuda cpu function - gpu kernel overlap ,I know how to execute the gpu and cpu functions concurrently. But here is another situation, the gpu and cpu functions have to execute serially, the problem is when cpu is blocking by gpu kernel executing, would the cpu process suspend? If yes, the occupancy rate of cpu should be low, right?

Below is my cuda code, quite simple, just for test

#include "cuda_runtime.h"
#include "device_launch_parameters.h"
#include 

__global__ void kernel(float *d_data)
{
    //dead loop
    while(1)
    {
        *d_data = -1;
        *d_data = 1/(*d_data);
        *d_data = (*d_data) / (*d_data);
    }
}


int main()
{
    float *d_data;
    cudaMalloc(&d_data, sizeof(float));
    kernel << <1, 1 >> >(d_data);
    //cpu process would be blocking here
    float data;
    cudaMemcpy(&data, d_data, sizeof(int), cudaMemcpyDeviceToHost);
    printf("%f
",data);
    return 0;
}

Using top to check the occupancy rate of cpu is 100%

%Cpu10 : 75.1 us, 24.9 sy,  0.0 ni,  0.0 id,  0.0 wa,  0.0 hi,  0.0 si,  0.0 st

and I have confirmed that the cpu process I launch is running on Cpu10.

Am I missing something? I am very grateful for your help!

Why can't I use gpu to reduce the cpu occupancy rate?

Answers (1)

Related Questions

Why can&#39;t I use gpu to reduce the cpu occupancy rate?

Answers (1)

Related Questions

Why can't I use gpu to reduce the cpu occupancy rate?