error in cuda shared memory static allocation

Question

I wrote a CUDA code using shared memory like this:

__global__ void matrix_mul_shared(float *ad,float *bd,float *cd,int N)
{
    float pvalue=0;
    int TILE=blockDim.x;
    int ty=threadIdx.y;
    int tx=threadIdx.x;

    //allocate shared memory per block
    __shared__ float ads[1][1];
    __shared__ float bds[1][1];

 .

. . }

This code works , but the following code fails;

__global__ void matrix_mul_shared(float *ad,float *bd,float *cd,int N)
{
    float pvalue=0;
    int TILE=blockDim.x;
    int ty=threadIdx.y;
    int tx=threadIdx.x;

    //allocate shared memory per block
    __shared__ float ads[TILE][TILE];
    __shared__ float bds[TILE][TILE];

 .
. 
.
}

The compiler is expecting something constant at the lines where I am allocating shared memory. It says(I forgot the exact error but it is something like this):

The parameters should be a constant

I was able to use printf and print the value of TILE, and it is coming out 1. so why this error?

error in cuda shared memory static allocation

Answers (1)

Related Questions