removing entries from a numpy array

Question

I have a multidimensional numpy array with the shape (4, 2000). Each column in the array is a 4D element where the first two elements represent 2D positions.

Now, I have an image mask with the same shape as an image which is binary and tells me which pixels are valid or invalid. An entry of 0 in the mask highlights pixels that are invalid.

Now, I would like to do is filter my first array based on this mask i.e. remove entries where the position elements in my first array correspond to invalid pixels in the image. This can be done by looking up the corresponding entries in the mask and marking those columns to be deleted which correspond to a 0 entry in the mask.

So, something like:

import numpy as np
# Let mask be a 2D array of 0 and 1s

array = np.random.rand(4, 2000)

for i in range(2000):
    current = array[:, i]
    if mask[current[0], current[1]] <= 0:
        # Somehow remove this entry from my array.

If possible, I would like to do this without looping as I have in my incomplete code.

unutbu · Accepted Answer

You could select the x and y coordinates from array like this:

xarr, yarr = array[0, :], array[1, :]

Then form a boolean array of shape (2000,) which is True wherever the mask is 1:

idx = mask[xarr, yarr].astype(bool)

mask[xarr, yarr] is using so-called "integer array indexing". All it means here is that the ith element of idx equals mask[xarr[i], yarr[i]].

Then select those columns from array:

result = array[:, idx]

import numpy as np

mask = np.random.randint(2, size=(500,500))
array = np.random.randint(500, size=(4, 2000))

xarr, yarr = array[0, :], array[1, :]
idx = mask[xarr, yarr].astype(bool)
result = array[:, idx]

cols = []
for i in range(2000):
    current = array[:, i]
    if mask[current[0], current[1]] > 0:
        cols.append(i)
expected = array[:, cols]

assert np.allclose(result, expected)

removing entries from a numpy array

Answers (2)

Related Questions