weird CUDA kernel result on old display driver

Question

I'm writing a CUDA program that to be run on thousands of different GPUs, those machine would have different version of display driver installed, I cannot force them to update to the latest driver. Actually most code runs fine on those 'old' machine, but fails with some particular code:

Here's the problem:

#include 
#include 
#include 

__global__
void test()
{
    unsigned i = 64;
    unsigned j = 192;
    int k = 7;

    for(j = 1 << (k - 1); i &j; j >>= 1)
        i ^= j;
    i ^= j;

    printf("i,j,k: %d,%d,%d
", i,j,k);
    // i,j,k: 32,32, 7  (correct)
    // i,j,k: 0, 64, 7  (wrong)
}

int main() {
    cudaSetDeviceFlags(cudaDeviceScheduleBlockingSync);

    test<<<1,1>>>();
}

The code prints 32,32,7 as result on GPU with latest driver, which is the correct result. But on old driver(lower than CUDA 6.5) it prints 0,64,7 .

I'm looking for any workaround for this.

Envoronment:

Developing: Win7-32bit, VS2013, CUDA 6.5
Corrent Result on: WinXP-32bit(and Win7-32bit), GTX-650(latest driver)
Wrong Result on: WinXP-32bit + GTX-750-Ti(old driver), WinXP-32bit + GTX-750(old driver)

talonmies · Accepted Answer

There is no workaround. The runtime API is versioned and the minimum driver version requirement is non-negotiable.

Your only two choices are to develop using the lowest common denominator toolkit version that supports the driver being used, or switch to the driver API.

weird CUDA kernel result on old display driver

Answers (2)

Related Questions