CUDA transfer memory during kernel execution

Question

I know that CUDA kernels can be "overlapped" by putting them into separate streams, but I'm wondering if would it be possible to transfer memory during kernel executions. CUDA kernels are asynchronous afterall

Roger Dahl · Accepted Answer

You can run kernels, transfers from host to device and transfers from device to host concurrently.

http://developer.download.nvidia.com/CUDA/training/StreamsAndConcurrencyWebinar.pdf

CUDA transfer memory during kernel execution

Answers (2)

Related Questions