Accelerator restriction: unsupported operation: RSQRTSS

Question

I have a simple nbody implementation code and try to compile it for launching on NVIDIA GPUs (Tesla K20m/Geforce GTX 650 Ti). I use the following compiler options:

-Minfo=all -acc -Minline -Mfpapprox -ta=tesla:cc35/nvidia

Everything works without -Mfpapprox, but when I use it, the compilation fails with the following output:

346, Accelerator restriction: unsupported operation: RSQRTSS

The 346 line writes as:

float rdistance=1.0f/sqrtf(drSquared);

where

float drSquared=dx*dx+dy*dy+dz*dz+softening;

and dx, dy, dz are float values. This line is inside the #pragma acc parallel loop independent for() construction. What is the problem with -Mfpapprox?

Accelerator restriction: unsupported operation: RSQRTSS

Answers (1)

Related Questions