Matlab: Euclidean norm (or difference) between two vectors

Question

I'd like to calculate the Euclidean distance between a vector G and each row of an array C, while dividing each row by a value in a vector GSD. What I've done seems very inefficient. What's my biggest overhead? Could I speed it up?

m=1E7;
G=1E5*rand(1,8);
C=1E5*[zeros(m,1),rand(m,8)]; 
GSD=10*rand(1,8);

%I've taken the log10 of the values because G and C are very large in magnitude. 
%Don't know if it's worth it.

for i=1:m
    dG(i,1)=norm((log10(G)-log10(C(i,2:end)))/log10(GSD));
end

Using the examples from below, they don't all give the same answer. In fact none of them give the same answer (see following figure using:

dG = pdist2(log10(G),log10(C(:,2:end)),'mahalanobis',diag(log10(GSD))); %(1)

dG = sqrt(sum((log10(G)-log10(C(:,2:end))./log10(GSD)).^2,2)); 

tmp=bsxfun(@rdivide,bsxfun(@minus,log10(G),log10(C(:,2:end))),log10(GSD)); %(4)
dG = sqrt(sum(tmp.^2,2));

Nicky Mattsson · Accepted Answer

You can use pdist2(x,y) to calculate the pairwise distance between all elements in x and y, thus your example would be something like

dG = pdist2(log10(G),log10(C(:,2:end)),'mahalanobis',diag(log10(GSD)).^2);

where the name-pair 'mahalanobis',diag(log10(GSD)).^2 puts log10(GSD) as weights on the Eucledean, which is the known as the Mahalanobis distance.

Note that the Mahalanobis distance is originally intented for normalising data, thus it is the "covariance" which have to be put as the fourth input, which MATLAB then finds the Cholesky decomposition of (element-wise squareroot when diagonal, as here).

Implicit expansion

In newer MATLAB editions, one can also just just the implcit expansion as the first entry is only 1 vector.

dG = sqrt(sum(((log10(G)-log10(C(:,2:9)))./log10(GSD)).^2,2));

which is probably a tad faster, I do, however, prefer the pdist2 solution as I find it clearer.

Matlab: Euclidean norm (or difference) between two vectors

Answers (2)

Related Questions