How do I allocate device memory to my array of pointers, in CUDA?

Question

I have the following data structures on my host:

typedef struct point{
   int x;
   int y;
}Point;

 typedef struct pair{
     Point i;
     Point j;
     float cost;
 }Pair;

Pair* pairs[n];   // allocates an array of pointers to pair

Now, I've to copy "pairs" to the GPU. So, I declare the following pointer:

Pair **d_pair;

and allocate the memory using the following:

cudaMalloc((void**)d_pair,(sizeof(Pair)+sizeof(Pair*))*n);

Now, I copy from host to device:

cudaMempy(d_pair,pair,(sizeof(Pair)+sizeof(Pair*))*n),cudaMemcpyHostToDevice);

The kernel prototype receives d_pair as:

__global__ my_kernel(Pair* d_pair[], ... ){ 
...
}

Should the above sequence of statements work as intended? If not, what modifications I make? Basically, I want to copy Pair* pairs[n]; as such to "d_pair". How do I do this?

How do I allocate device memory to my array of pointers, in CUDA?

Answers (1)

Related Questions