Python: Filtering numpy values based on certain columns

Question

I'm trying to create a method for evaluating co-ordinates for a project that's due in about a week.

Assuming that I'm working in a 3D cartesian co-ordinate system - whose values are stored as rows in a numpy array. I am trying to read if 'z' (n[i, 2]) values exist given the corresponding, predetermined 'x' (n[i,0]) and 'y' (n[i,1]) values.

In the case where the values that are assigned are scalars, I am content to think that:

# Given that n is some numpy array
x, y = 2,3 
out = []
for i in range(0,n.shape[0]):
 if n[i, 0] == x and n[i,1] == y:
  out.append(n[i,2])

However, where the sorrow comes in is having to check if the values in another numpy array are in the original numpy array 'n'.

# Given that n is the numpy array that is to be searched
# Given that x contains the 'search elements'
out = []
for i in range(0,n.shape[0]):
 for j in range(0, x.shape[0]):
  if n[i, 0] == x[j,0] and n[i,1] == x[j,1]:
   out.append(n[i,2])

The issue with doing it this way is that the 'n' matrix in my application may well be in excess of 100 000 lines long.

Is there a more efficient way of performing this function?

today · Accepted Answer

This might be more efficient than nested loops:

out = []
for row in x:
    idx = np.equal(n[:,:2], row).all(1)
    out.extend(n[idx,2].tolist())

Note this assumes that x is of shape (?, 2). Otherwise, if it has more than two columns, just change row to row[:2] in the loop body.

Python: Filtering numpy values based on certain columns

Answers (2)

Related Questions