Using Shared & Constant Memory in CUDA

Question

I want to read a text file and store it in an array. Then, I want to transfer the array from the host to the device and store it in the shared memory. I have written the following code,but the execution time has been increased compared with using the global memory. I cannot understand what the reason can be? Also, it will be great if someone can help me write this code using constant memory.

__global__ void deviceFunction(char *pBuffer,int pSize){
    extern __shared__ char p[];
    int i;
    for(i=0;i>>(pBuffer_device,pSize);

}

Using Shared & Constant Memory in CUDA

Answers (1)

Related Questions

Using Shared &amp; Constant Memory in CUDA

Answers (1)

Related Questions

Using Shared & Constant Memory in CUDA