Computation difference between function and manual computation

Question

I am facing a mystery right now. I get strange results in some program and I think it may be related to the computation since I got different results with my functions compared to manual computation.

This is from my program, I am printing the values pre-computation :

print("
Precomputation:
matrix
:", matrix)
tmp = likelihood_left * likelihood_right
print("
conditional_dep:", tmp)
print("
final result:", matrix @ tmp)

I got the following output:

Precomputation:
matrix: 
[array([0.08078721, 0.5802404 , 0.16957052, 0.09629893, 0.07310294])
 array([0.14633129, 0.45458744, 0.20096238, 0.02142105, 0.17669784])
 array([0.41198731, 0.06197812, 0.05934063, 0.23325626, 0.23343768])
 array([0.15686545, 0.29516415, 0.20095091, 0.14720275, 0.19981674])
 array([0.15965914, 0.18383683, 0.10606946, 0.14234812, 0.40808645])]

conditional_dep: [0.01391123 0.01388155 0.17221067 0.02675524 0.01033257]
final result: [0.07995043 0.03485223 0.02184015 0.04721548 0.05323298]

The thing is when I compute the following code:

matrix = [np.array([0.08078721, 0.5802404 , 0.16957052, 0.09629893, 0.07310294]),
          np.array([0.14633129, 0.45458744, 0.20096238, 0.02142105, 0.17669784]), 
          np.array([0.41198731, 0.06197812, 0.05934063, 0.23325626, 0.23343768]), 
          np.array([0.15686545, 0.29516415, 0.20095091, 0.14720275, 0.19981674]), 
          np.array([0.15965914, 0.18383683, 0.10606946, 0.14234812, 0.40808645])]

tmp = np.asarray([0.01391123, 0.01388155, 0.17221067, 0.02675524, 0.01033257])

matrix @ tmp

The values in use are exactly the same as they should be in the computation before but I get the following result:

array([0.04171218, 0.04535276, 0.02546353, 0.04688848, 0.03106443])

This result is then obviously different than the previous one and is the true one (I computed the dot product by hand).

I have been facing this problem the whole day and I did not find anything useful online. If any of you have any even tiny idea where it can come from I'd be really happy :D

Thank's in advance Yann

PS: I can show more of the code if needed. PS2: I don't know if it is relevant but this is used in a dynamic programming algorithm.

Seb · Accepted Answer

To recap our discussion in the comments, in the first part ("pre-computation"), the following is true about the matrix object:

>>> matrix.shape
(5,)
>>> matrix.dtype
dtype('O') # aka object

And as you say, this is due to matrix being a slice of a larger, non-uniform array. Let's recreate this situation:

>>> matrix = np.array([[], np.array([0.08078721, 0.5802404 , 0.16957052, 0.09629893, 0.07310294]), np.array([0.14633129, 0.45458744, 0.20096238, 0.02142105, 0.17669784]), np.array([0.41198731, 0.06197812, 0.05934063, 0.23325626, 0.23343768]), np.array([0.15686545, 0.29516415, 0.20095091, 0.14720275, 0.19981674]), np.array([0.15965914, 0.18383683, 0.10606946, 0.14234812, 0.40808645])])[1:]

It is now not a matrix with scalars in rows and columns, but a column vector of column vectors. Technically, matrix @ tmp is an operation between two 1-D arrays and hence NumPy should, according to the documentation, calculate the inner product of the two. This is true in this case, with the convention that the sum be over the first axis:

>>> np.array([matrix[i] * tmp[i] for i in range(5)]).sum(axis=0)
array([0.07995043, 0.03485222, 0.02184015, 0.04721548, 0.05323298])
>>> matrix @ tmp
array([0.07995043, 0.03485222, 0.02184015, 0.04721548, 0.05323298])

This is essentially the same as taking the transpose of the proper 2-D matrix before the multiplication:

>>> np.stack(matrix).T @ tmp
array([0.07995043, 0.03485222, 0.02184015, 0.04721548, 0.05323298])

Equivalently, as noted by @jirasssimok:

>>> tmp @ np.stack(matrix)
array([0.07995043, 0.03485222, 0.02184015, 0.04721548, 0.05323298])

Hence the erroneous or unexpected result.

As you have already resolved to do in the comments, this can be avoided in the future by ensuring all matrices are proper 2-D arrays.

Computation difference between function and manual computation

Answers (2)

Related Questions