How to fix error calling a __host__ function("std::max ") is not allowed in CUDA?

Question

#include 
#include 
template 
    __global__ void R_D_CUT(const int n, Dtype* r, Dtype* d
        , Dtype cur_r_max, Dtype cur_r_min, Dtype cur_d_max, Dtype cur_d_min) {
        CUDA_KERNEL_LOOP(index, n) {
            r[index] = __min(cur_r_max, __max(r[index], cur_r_min));
            d[index] = __min(cur_d_max, __max(d[index], cur_d_min));
        }
    }

In above code, it can work well in Window. However, it does not work in Ubuntu due to __min and __max function. To fix it by replace __min to std::min and max to std::max:

template 
    __global__ void R_D_CUT(const int n, Dtype* r, Dtype* d
        , Dtype cur_r_max, Dtype cur_r_min, Dtype cur_d_max, Dtype cur_d_min) {
        CUDA_KERNEL_LOOP(index, n) {

            r[index] = std::min(cur_r_max, std::max(r[index], cur_r_min));
            d[index] = std::min(cur_d_max, std::max(d[index], cur_d_min));
        }
    }

However, when I recompile, I got the error

_layer.cu(7): error: calling a __host__ function("std::min ") from a __global__ function("caffe::R_D_CUT ") is not allowed

_layer.cu(7): error: calling a __host__ function("std::max ") from a __global__ function("caffe::R_D_CUT ") is not allowed

_layer_layer.cu(8): error: calling a __host__ function("std::min ") from a __global__ function("caffe::R_D_CUT ") is not allowed

_layer_layer.cu(8): error: calling a __host__ function("std::max ") from a __global__ function("caffe::R_D_CUT ") is not allowed

_layer_layer.cu(7): error: calling a __host__ function("std::min ") from a __global__ function("caffe::R_D_CUT ") is not allowed

_layer_layer.cu(7): error: calling a __host__ function("std::max ") from a __global__ function("caffe::R_D_CUT ") is not allowed

_layer_layer.cu(8): error: calling a __host__ function("std::min ") from a __global__ function("caffe::R_D_CUT ") is not allowed

_layer_layer.cu(8): error: calling a __host__ function("std::max ") from a __global__ function("caffe::R_D_CUT ") is not allowed

Could you help me to fix it? Thanks

Robert Crovella · Accepted Answer

Generally speaking, functionality associated with std:: is not available in CUDA device code (__global__ or __device__ functions).

Instead, for many math functions, NVIDIA provides a CUDA math library.

For this case, as @njuffa points out, CUDA provides templated/overloaded versions of min and max. So you should just be able to use min() or max() in device code, assuming the type usage corresponds to one of the available templated/overloaded types. Also, you should:

#include

Here is a simple worked example showing usage of min() for both float and double type:

$ cat t381.cu
#include 
#include 

template 
__global__ void mymin(T d1, T d2){

  printf("min is :%f
", min(d1,d2));
}


int main(){

  mymin<<<1,1>>>(1.0, 2.0);
  mymin<<<1,1>>>(3.0f, 4.0f);
  cudaDeviceSynchronize();
}
$ nvcc -arch=sm_52 -o t381 t381.cu
$ ./t381
min is :1.000000
min is :3.000000
$

Note that the available overloaded options even include some integer types

How to fix error calling a host function("std::max<double> ") is not allowed in CUDA?

Answers (2)

Related Questions

How to fix error calling a __host__ function(&quot;std::max&lt;double&gt; &quot;) is not allowed in CUDA?

Answers (2)

Related Questions

How to fix error calling a host function("std::max<double> ") is not allowed in CUDA?