Member function of a C++ object as a CUDA __global__ function

Question

I have a base class:

  template  
  class A{
         public:
           // some data
           T data;
           //some functions like constructs etc.
            ...
           // one virtual function
           virtual void evaluate() = 0;

   }

and a derived class:

 template  
 class B:public A{
          public:
          // some functions like constructors etc.
          virtual void evaluate();
          __global__ void function2();   // **** error message

 }

Also, I have

 template  void
 B::evaluate()
 { 
    dim3 grid(1);dim3 block(1);  
    void function2<<>>();
 }

and

template   __global__ void B::function2() // **** error message 
{
   // computation here
}

so essentially I have a member function of a derived class which I would like to execute in a parallel fashion on the device.

Unfortunately, I get the error:

error : illegal combination of memory qualifiers on the lines :

1> __global__ void function2();   // **** error message

2> template   __global__ void B::function2() // **** error message

I am new to CUDA. It would be very kind if someone points me to my error. I am developing on Visual Studio 2010.

talonmies · Accepted Answer

The template class definition in your first code snippet is illegal because it contains a __global__ function (CUDA kernel). As per the language documentation, __global__ functions cannot be static class member functions. The second templated class member function is illegal for the same reason.

Member function of a C++ object as a CUDA global function

Answers (1)

Related Questions