Problem with summing entries using the elementwise kernel in cupy

Question

In the first code example (kernel_conv), I programmed a simple convolution and it worked with the expected result [1,1,2,1,1].

Then I used the elementwise kernel to sum all entries of a vector. However, if I run the second example (kernel_sum) I get the result [3,0,0] but would expect [6,0,0].

What is the difference between these two examples? Why is the variable y in the first example updated and in the second it seems to be overwritten?

import numpy as np 
import cupy as cp 

kernel_conv = cp.ElementwiseKernel(
    'raw float32 x', 'raw float32 y',
    ''' int idx = i*2 + 1;
        for(size_t j=0;j<3;j++){
          y[idx - 1 + j] += x[j];
        }
    ''', 'conv')

x = cp.asarray(np.array([1,1,1]),dtype=np.float32)
y = cp.zeros((5,),dtype=np.float32)
z = kernel_conv(x,y,size=2)
print(z)

kernel_sum = cp.ElementwiseKernel(
  'raw float32 x', 'raw float32 y',
  ''' 
      y[0] += x[i]
  ''', 'summe')

x = cp.asarray(np.array([1, 2, 3]), dtype=np.float32)
y = cp.zeros((3,),dtype=np.float32)
z = kernel_sum(x,y,size=3)
print(z)

Problem with summing entries using the elementwise kernel in cupy

Answers (1)

Related Questions