Loop unrolling for multiplying two matrices NxN?

Question

I'm trying to figure out a good Loop unrolling for multiplying two matrices .

For example if we wanted to Sum a NxN matrix :

void SumMatrix(int *M, int n, int *result) 
{ 
  int  i,j; 

  *result = 0; 
  for (i=0; i



We can do this : 

void SumMatrix(int *M, int n, int *result) 
{ 
    int  i; 
    int  size = n*n; 
    int  last = size%8; 
    int  acc1 = 0; 
    int  acc2 = 0; 
    int  *pEnd = M+size-last; 

    for (; M


But I've tried to find a (GOOD) way to multiply 2 matrices , however found none at the moment . 

Remark : this is no homework task , I have an exam today and just thought about this question , I think it could be a fine question for an exam , don't you  ?

I'd appreciate any help 

Regards

Ron

Vanwaril · Accepted Answer

Most compilers will do the unrolling for you (you might need to turn on a flag, or set it to an optimization level - I believe -funroll-loops does it for gcc).

Also, with your question, the fact that it is a 2D matrix doesn't matter, since you are adding all the numbers up. If you are limited to a single process/thread, adding the numbers up sequentially will be the fastest because that has optimal caching performance. You might get some benefit out of SSE or vector instructions; again, today's compilers can do these for you with such a simple problem.

Loop unrolling for multiplying two matrices NxN?

Answers (2)

Related Questions