Performing im2Col with numpy.lib.stride_tricks.as_strided

Question

Consider a 4d array:

x = [[[[ 1  2  3  4]
       [ 5  6  7  8]
       [ 9 10 11 12]
       [13 14 15 16]]
    
      [[17 18 19 20]
       [21 22 23 24]
       [25 26 27 28]
       [29 30 31 32]]]]

I want to slide a (2x2) filter through both of these channels, and flatten each window and concatenate. The expected output is:

[[[ 1.  2.  3.  5.  6.  7.  9. 10. 11.]
  [ 2.  3.  4.  6.  7.  8. 10. 11. 12.]
  [ 5.  6.  7.  9. 10. 11. 13. 14. 15.]
  [ 6.  7.  8. 10. 11. 12. 14. 15. 16.]

  [17. 18. 19. 21. 22. 23. 25. 26. 27.]
  [18. 19. 20. 22. 23. 24. 26. 27. 28.]
  [21. 22. 23. 25. 26. 27. 29. 30. 31.]
  [22. 23. 24. 26. 27. 28. 30. 31. 32.]]]

Although i have shown these channels separately, they are part of the same array. My question is, is this achievable solely through the strides utility as_strided. Heres what i have tried.

import numpy as np

def getWindows(input: np.ndarray, kernel_size: int):
    batch_str, channel_str, kern_h_str, kern_w_str = input.strides
    out_size =  input.shape[2] - kernel_size + 1
    return (
        np.lib.stride_tricks.as_strided(
            input,
            (input.shape[0], input.shape[1], out_size, out_size, kernel_size, kernel_size),
            (batch_str, channel_str, kern_h_str, kern_w_str, kern_h_str, kern_w_str)
        )
        .reshape(
            input.shape[0], 
            input.shape[1], 
            out_size * out_size, 
            kernel_size * kernel_size
        )
        .swapaxes(2, 3)
        .reshape(
            input.shape[0], 
            input.shape[1] * kernel_size * kernel_size,
            out_size * out_size
        )
    )
x = np.arange(1, 33).reshape(1, 2, 4, 4)
print(x)
print(getWindows(x, kernel_size=2))

Although this solution works, its extremely slow and ugly.

Performing im2Col with numpy.lib.stride_tricks.as_strided

Answers (1)

Related Questions