Matrix representation in C

Question

I want to find out what is the best representation of a m x n real matrix in C programming language.

What are advantages of matrix representation as a single pointer:

double* A;

With this representation you could allocate memory:

A = (double* )malloc(m * n * sizeof(double));

In such representation matrix access requires an extra multiplication:

aij = A[i * m + j];

What are disadvantages of matrix representation as a double pointer:

double** B;

Memory allocation requires a loop:

double** B = (double **) malloc(m * sizeof(double*));
for (i = 0; i < m; i++)
    A[i] = (double *) malloc(n * sizeof(double))

In such representation you could use intuitive double indexing `bij = B[i][j], but is there some drawback that would affect performance. I would want to know what is the best presentation in terms of performance.

These matrices should be used in numerical algorithms such as singular value decomposition. I need to define a function:

void svd(Matrix A, Matrix U, Matrix Sigma, Matrix V);

and I am looking for the best way to represent Matrix. If there is any other efficient way to represent a matrix in C, please, let me know.

I have seen that most people use single pointer representation. I would like to know if there are some performance benefits as opposed to double array representation?

Jonathan Leffler · Accepted Answer

Look at the memory accesses required.

For the single-pointer case, you have:

read a pointer (the base address), probably from a register
read the four integers, probably from registers or hard-coded into instruction set. For array[i*m+j], the 4 values are i, m, j and sizeof(array[0]).
multiply and add
access the memory address

For the double-pointer case, you have:

read a pointer (the base address), probably from a register
read an index, probably from a register
multiply the index by the size of a pointer and add.
fetch the base address from memory (unlikely to be a register, might be in cache with luck).
read another index, probably from a register
multiply by the size of the object and add
access the memory address

The fact that you have to access two memory locations probably makes the double-pointer solution quite a bit slower than the single-pointer solution. Clearly, caching will be critical; that's one reason why it is important to access arrays so that the accesses are cache-friendly (so you access adjacent memory locations as often as possible).

You can nit-pick about details in my outline, and some 'multiplication' operations may be shift operations, etc, but the general concept remains: the double-pointer requires two memory accesses versus one for the single-pointer solution, and that will be slower.

Matrix representation in C

Answers (2)

Related Questions