What is the clock measure by clock() and clock64() in CUDA?

Question

What is the clock measure by clock() and clock64() in CUDA ?

According to CUDA documentation the clock is 'per-multiprocessor counter'. According to my understanding this refers to Primary GPU clock (not the shader clock).

But when I measure clock counts and convert it to time values using primary GPU clock frequency, the results I get are twice large as the real values (I measure real values using the kernel execution time from host code using cuda events). This suggests clock() returns the shader clock frequency instead of the primary GPU clock.

How can I solve this confusion ?

EDIT : I calculated the primary GPU clock frequency by dividing the clock rate I get from cudaGetDeviceProperties by 2. As far as I understand the value given by cudaGetDeviceProperties is the shader clock frequency.

What is the clock measure by clock() and clock64() in CUDA?

Answers (1)

Related Questions