Is it possible to launch a cuda kernel with gridsize/block size defined at runtime?

Question

I would like to know whether its possible to launch a cuda kernel so that the grid/block size can be mentioned at run time instead of compile time as usual.

Any help regarding this would be highly invaluable.

sgarizvi · Accepted Answer

In CUDA applications, it is never very useful to specify fixed sizes for grid. Most of the time block size is fixed and grid size is kept dynamic and changed according to input data size. Consider the following example of vector addition.

__global__ void kernel(float* a, float* b, float* c, int length)
{
    int tid = blockIdx.x * blockDim.x + threadIdx.x;

    //Bound checks inside the kernel
    if(tid>>(a,b,c,length);

   return 0;
}

Is it possible to launch a cuda kernel with gridsize/block size defined at runtime?

Answers (2)

Related Questions