Numpy: Pick elements based on bool array

Question

I've got an array and a boolean array (as one hot encoding)

a = np.arange(12).reshape(4,3)
b = np.array([
    [1,0,0],
    [0,1,0],
    [0,0,1],
    [0,0,1],
], dtype=bool)

print(a)
print(b)
# [[ 0  1  2]
#  [ 3  4  5]
#  [ 6  7  8]
#  [ 9 10 11]]
# [[ True False False]
#  [False  True False]
#  [False False  True]
#  [False False  True]]

And I would like to pick elements using a boolean array

print(a[:, [True, False, False]])
# array([[0],
#        [3],
#        [6],
#        [9]])

print(a[:, [False, True, False]])
# array([[ 1],
#        [ 4],
#        [ 7],
#        [10]])

But this picks based on the same template boolean for all rows. I would like to perform this on a per row basis:

print(a[:, b])
# IndexError: too many indices for array

What should I put in ... so I get:

print(a[:, ...])
# array([[0],
#        [4],
#        [8],
#        [11]])

EDIT: This is analogous to what was used in the infamous CS231 course:

dscores = a
num_examples = 4 
# They had 300
y = b
dscores[range(num_examples),y]
# equivalent to
# a{:,b]

EDIT 2: In CS231 example, y is one dimensional and is not one hot encoded!

They were doing dscores[[rowIdx],[columnIdx]]

BENY · Accepted Answer

After filter by b broadcast it

a[b][:,None]
Out[168]: 
array([[ 0],
       [ 4],
       [ 8],
       [11]])

Or

a[b,None]
Out[174]: 
array([[ 0],
       [ 4],
       [ 8],
       [11]])

Numpy: Pick elements based on bool array

Answers (2)

Related Questions