Is this way of allocating a device object "correct"?

Question

SO I asked a question before about how to allocate an object on the device directly instead of the "normal":

Allocate on host
Copy to device
Copy dynamically allocated fields to device one by one

The main reason I want it to be allocated directly on the device is that I don't want to copy each dynamically allocated field inside one by one manually.

Anyway, so I think I have actually found a way to do this, and I would like to see some input from more experienced CUDA programmers (like Robert Crovella).

Let's see the code first:

class Particle
{
    public:
    int *data;

    __device__ Particle()
    {
        data = new int[10];
        for (int i=0; i<10; i++)
        {
            data[i] = i*2;
        }
    }
};


__global__ void test(Particle **result)
{
    Particle *p = new Particle();

    result[0] = p; // store memory location
}

__global__ void test2(Particle *p)
{
    for (int i=0; i<10; i++)
        printf("%d
", p->data[i]);

}

int main() {
    // initialise and allocate an object on device
    Particle **d_p_addr;
    cudaMalloc((void**)&d_p_addr, sizeof(Particle*));
    test<<<1,1>>>(d_p_addr);

    // copy pointer to host memory
    Particle  **p_addr = new Particle*[1];
    cudaMemcpy(p_addr, d_p_addr, sizeof(Particle*), cudaMemcpyDeviceToHost);

    // test:
    test2<<<1,1>>>(p_addr[0]);

    cudaDeviceSynchronize();

    printf("Done!
");

}

As you can see, what I do is:

Call a kernel that initialises an object on the device and stores its pointer an output parameter
Copy the pointer to the allocated object from device memory to host memory
Now you can pass that pointer to another kernel just fine !

This code actually works, but I'm not sure if there are drawbacks.

Cheers

EDIT: as pointed out by Robert, there was no point of creating a pointer on host first, so I removed that part from the code.

Is this way of allocating a device object "correct"?

Answers (1)

Related Questions

Is this way of allocating a device object &quot;correct&quot;?

Answers (1)

Related Questions

Is this way of allocating a device object "correct"?