caffe cudnn version 4 & 5

Question

I use cudnn acceleration in my caffe program. I use cudnn 4 at the begin and it's working fine but when I updated the cudnn to version 5.0, the pow function doesn't work. The calling function is in the batch_norm layer as

caffe_gpu_powx(variance_.count(), variance_.gpu_data(), Dtype(0.5), variance_.mutable_gpu_data());

And the data after calling doesn't change. The pow function is defined as below, as same as in the caffe github banch

template 
\__global__ void powx_kernel(const int n, const Dtype* a,
    const Dtype alpha, Dtype* y)
 {
     CUDA_KERNEL_LOOP(index, n)
     {           
         y[index] = pow(a[index], alpha);  
     }  
}

template <>
void caffe_gpu_powx(const int N, const float* a,
    const float alpha, float* y) {
    // NOLINT_NEXT_LINE(whitespace/operators)
    powx_kernel<<>>(
      N, a, alpha, y);
}

caffe cudnn version 4 & 5

Answers (1)

Related Questions

caffe cudnn version 4 &amp; 5

Answers (1)

Related Questions

caffe cudnn version 4 & 5