Reputation: 498
I get about 3-4x times difference in computation time of a same CUDA kernel compiled on two different machines. Both versions run on a same machine and GPU device. The direct conclusion explaining the difference is different compiler settings. Although there is no single perfect setting and the tuning should be customized depending on the kernel, I wonder if there is any clear guideline for helping to choose the right settings. I use Visual Studio 2010. Thank you.
Upvotes: 0
Views: 604
Reputation: 151809
-G
switch passed to the nvcc compiler will usually have a negative effect on GPU code performance.Upvotes: 3