Object Lifetime and cudaMemcpy

Question

I'm trying to transfer a buffer containing Array classes to the device, where an Array class is:

struct Array {
     float* const ptr;
     const size_t length;

     Array(float* const ptr, const size_t length) : ptr(ptr), length(length) {}
};

To construct a buffer of arrays in host-code, I am using the placement new operator because the class is not copy-assignable.

Normally I would use cudaMemcpy as follows:

Array* arrays = (Array*) malloc(sizeof(Array) * 3));
new (arrays + 0) (nullptr, 0);
new (arrays + 1) (nullptr, 0);
new (arrays + 2) (nullptr, 0);

Array* device_arrays;
cudaMalloc(&device_arrays, sizeof(Array) * 3);

cudaMemcpy((void*) device_arrays, (void*) arrays, sizeof(Array) * 3, cudaMemcpyHostToDevice);

However, since I am now using const members and a constructor, it occurred to me that while the Array class is trivially copyable, it isn't getting "constructed" by cudaMemcpy. Is it valid to use the device_arrays pointer in a kernel, for example:

__global__ void foo(Array* device_arrays) {
     int l = device_arrays[0].length;
}

Or do I need to construct the Array object in device code? (If I need to construct it separately, it would seem like this would only be possibly by transferring the ptr and length data in POD form, and constructing the Array object in a kernel from the POD data. It does not seem like something that can be automated with a templated function).

Object Lifetime and cudaMemcpy

Answers (1)

Related Questions