How to ignore zeros in matrix multiplications?

Question

Let's say I have a 10000 x 10000 matrix W with random numbers, and two 10000 dim vectors U and V, U has random numbers in it, V is filled with zeros. With numpy or pytorch, computing U @ W and V @ W takes the same amount of time. My question is, is there a way to optimize matrix multiplication so that it skips or ignores zeros during calculation, so things like V @ W will be computed faster?

import numpy as np
W = np.random.rand(10000, 10000)

U = np.random.rand(10000)
V = np.zeros(10000)

y1 = U @ W
y2 = V @ W
# computing y2 should take less amount of time than y1 since it always returns zero vector.

Grr · Accepted Answer

You can use scipy.sparse classes to improve your performance, but it entirely depends on the matrix. For example the performance gained from using V as a sparse matrix will be great. That gained by converting U to a sparse matrix will not be great or may in fact decrease performance (in the case U is in fact dense).

import numpy as np
import scipy.sparse as sps

W = np.random.rand(10000, 10000)
U = np.random.rand(10000)
V = np.zeros(10000)

%timeit U @ W
125 ms ± 1.45 ms per loop (mean ± std. dev. of 7 runs, 10 loops each)
%timeit V @ W
128 ms ± 6.4 ms per loop (mean ± std. dev. of 7 runs, 10 loops each)

Vsp = sps.csr_matrix(V)
Usp = sps.csr_matrix(U)
Wsp = sps.csr_matrix(W)

%timeit Vsp.dot(Wsp)
1.34 ms ± 15.6 µs per loop (mean ± std. dev. of 7 runs, 1000 loops each) 
%timeit Vsp @ Wsp
1.39 ms ± 37.2 µs per loop (mean ± std. dev. of 7 runs, 1000 loops each)
%timeit Usp @ Wsp
2.37 s ± 84.3 ms per loop (mean ± std. dev. of 7 runs, 1 loop each)

As you can see there is a major improvement from using sparse methods for V @ W, but you in fact decrease performance for U @ W as none of the entries in U or W are zero.

How to ignore zeros in matrix multiplications?

Answers (2)

Related Questions