Vectorising (or speeding up) a double loop with summation over non-identical indices in R

Question

I am trying to optimise the code designed to compute double sums of product of the elements of two square matrices. Let’s say we have two square matrices of size n, W and V. The object that needs to be computed is a vector B with elements

$B_i=\sum_{\substack{j=1\j e i}}^n\sum_{\substack{k=1\k e j\k e i\j e i}}^nW_{ik}V_{jk}$

In simple terms: compute element-by-element products of two different rows in two different matrices and take their sum, then take an extra sum over all rows of the second matrix (sans identical indices).

The problem is, the computational complexity of this task seemingly O(n³) because the length of this object we are creating, B, is n, and each element requires two summations. This is what I have come up with:

For given i and j (i≠j), start with the inner sum over k. Sum for all k, then subtract the terms for k=i and k=j, and multiply by the indicator of j≠i.
Since the restriction j≠i has been taken care of in the inner sum, the outer sum is taken just for j=1,...,n.

If we denote $A_{ijk}=W_{ik}V_{jk}$ , then the two steps will look like $B_{ij}=\sum_{k=1}^n(A_{ijk}-A_{iji}-A_{ijj})\mathbb{I}(i e j)$ and $B_{i}=\sum_{j=1}^nB_{ij}$ .

However, writing a loop turned out to be very inefficient. n=100 works quickly (0.05 seconds). But, for instance, when n=500 (we are talking about real-world applications here), the average computation time is 3 seconds, and for n=1000, it jumps to 22 s.

The inner loop over k can be easily replaced by a sum, but the outer one... In this question, the suggested solution is sapply, but it implies that the summation must be done over all elements.

This is the code I am trying to evaluate before the heat death of the Universe for large n.

set.seed(1)
N <- 500
x1 <- rnorm(N)
x2 <- rchisq(N, df=3)

bw1 <- bw.nrd(x1)
bw2 <- bw.nrd(x2)
w <- outer(x1, x1, function(x, y) dnorm((x-y)/bw1) )
w <- w/rowSums(w)

v <- outer(x2, x2, function(x, y) dnorm((x-y)/bw2) )
v <- v/rowSums(v)

Bij <- matrix(NA, ncol=N, nrow=N)
for (i in 1:N) { # Around 22 secs for N=1000
  for (j in 1:N) {
    Bij[i, j] <- (sum(w[i, ]*v[j, ]) - w[i, i]*v[j, i] - w[i, j]*v[j, j]) * (i!=j)
  }
}
Bi <- rowSums(Bij)

How would an expert R programmer vectorise such kind of loops?

RolandASc · Accepted Answer

Without looking into the content of your matrices w and v, your double for-loop can be replaced with simple matrix operations, using one matrix multiplication (tcrossprod), transpose (t) and diagonal extraction:

Mat.ij <- tcrossprod(w, v) - 
    matrix(rep(diag(w), times = N), nrow = N) * t(v) - 
    w * matrix(rep(diag(v), each = N), nrow = N)
diag(Mat.ij) <- 0

all.equal(Bij, Mat.ij)
[1] TRUE

Vectorising (or speeding up) a double loop with summation over non-identical indices in R

Answers (2)

Related Questions