numpy: find first index of value in each row of 2D array

Question

How can I find the first index of a value in each row of a 2D array, using vectorized numpy functions?

For example, given

I = numpy.array([1,1,1]
M = numpy.array([[1,2,3],[2,3,1],[3,1,2]])

The output should be:

array([0, 2, 1])

I can do it with a list comprehension like this:

[ numpy.where(M[i] == I[i])[0][0] for i in range(0, len(I)) ]

What would the numpy equivalent be?

eickenberg · Accepted Answer

A possibility of exploiting vectorization is as follows

coords = ((I[:, np.newaxis] == M) * np.arange(M.shape[1], 0, -1)[np.newaxis, :]).argmax(1)
any = (I[:, np.newaxis] == M).any(1)
coords = coords[any]

It disambiguates between several occurrences of the value of interest in the same line by multiplying a decreasing counter to each line, making the first occurence have the highest value. If a given line does not contain the indicated value, then it is removed from coords. The remaining lines (in which the corresponding value was found) are indexed by any

numpy: find first index of value in each row of 2D array

Answers (2)

Related Questions