Taking a Different Subset of Indices per Row in Numpy Using Fancy Indexing

Question

I have an array of pairwise differences of features:

diff.shape = (200, 200, 2)

of which I am trying to take only the columns corresponding to the 50 closest points. For each row, I have the indices of the closest 50 points stored as:

dist_idx.shape = (200, 50).

How can I index the 50 closest entries (different indices per row) using fancy indexing? I have tried:

diff[dist_idx].shape = (200, 50, 200, 2)
diff[np.arange(200), dist_idx] -> IndexError: shape mismatch: indexing arrays could not be broadcast together with shapes (200,) (200,50) 
diff[np.arange(200), dist_idx[np.arange(200)]] -> IndexError: shape mismatch: indexing arrays could not be broadcast together with shapes (200,) (200,50)

An iterative solution that works is:

X_diff = np.zeros((200, 50, 2))
for i in range(200):
    X_diff[i] = diff[i, dist_idx[i]]

Taking a Different Subset of Indices per Row in Numpy Using Fancy Indexing

Answers (1)

Related Questions