Pairs of distances squared - numpy - python

Question

I am reading a text (see K-nearest neighors example)

which gives this line of code

   dist_sq = np.sum((X[:,np.newaxis,:] - X[np.newaxis,:,:]) ** 2, axis=-1)

Here X is a numpy 10x2 array which represents 10 points in the 2D plane.
It was initialized like this:

X = np.random.rand(10, 2)

OK... The text claims this line computes the pairs of squared distances between the points.
I have no idea why this works and if it works. I tried understanding it but I just can't. I personally try to avoid such cryptic code. This is just not human IMHO. The text explains this code in some details but it seems I don't get that explanation either.

Also, axis=-1 adds up to the confusion.

Could someone decrypt this line of code?

Also, what is the point of saying e.g. X[:,np.newaxis,:], X[np.newaxis,:,:]?

Isn't X[:,np.newaxis], X[np.newaxis,:] enough? Isn't it doing the same?!

Also, from combinatorics, the squared distances count should be 10*9/2 or 10*10/2 (if we include equal points which have distance 0), but this dist_sq is a 10x10x2 array. So this also adds up to the confusion?! Why 200 elements?!

Pairs of distances squared - numpy - python

Answers (1)

Related Questions