How do you use cblas_dgemm to do a vector outer product?

Question

I am trying to do a Column Vector multiplication with a Row Vector. Can I use dgemm?

In other words D = A * B where D is a Matrix, A is a Column Vector and B is a Row Vector.

I followed the documentation found here https://software.intel.com/en-us/node/520775. I cannot seem to get the parameters right for cblas_dgemm

Here is my try. In my case m = nRows, n = nCols, k = 1

The problem seems to be lda, ldb and ldc. I have defined them respectively as nCols, k, nRows.

#include 
#include 
#include 
#include 

#include 
#include 

#define nCols 5
#define nRows 20
#define k 1

void PrintMatrix(double* pMatrix, const size_t nR, const size_t nC, const CBLAS_ORDER Order) {
    unsigned int i, j;
    if (Order == CblasRowMajor)
    {
        for (i = 0; i < nR; i++)
        {
            for (j = 0; j < nC; j++)
            {
                printf("%f 	 ", pMatrix[i * nC + j]); // !!!
            }
            printf("
"); // !!!
        }

    }

    else

    {
        for (i = 0; i < nR; i++) {
            for (j = 0; j < nC; j++) {
                printf("%f 	 ", pMatrix[i + j* nR ]); // !!!
            }
            printf("
"); // !!!
        }

    }
    printf("
"); // !!!

}

int main(void) {

    double A[] = { 8, 4, 7, 3, 5, 1, 1, 3, 2, 1, 2, 3, 2, 0, 1, 1 , 2, 3, 4, 1};

    double B[] = { -1, 2, -1, 1, 2 };


    double alpha = 1.0, beta = 0.0;
    int i, lda, ldb, ldc;
    double *C, *D;
    D = (double*) malloc(nRows * nCols * sizeof(double));
    C = (double*) malloc(nRows * nCols * sizeof(double));

    for (i = 0; i < nRows*nCols; i++)
        D[i] = 0.0;
    for (i = 0; i < nRows*nCols; i++)
        C[i] = 0.0;

    lda = nCols;
    ldb = k;
    ldc = nRows;

    cblas_dger(CblasRowMajor, nRows, nCols, alpha, A, 1, B, 1, C, nCols);

    PrintMatrix(C, nRows, nCols,CblasRowMajor);
    cblas_dgemm (CblasRowMajor, CblasNoTrans, CblasNoTrans, nRows,  nCols, k, alpha, A, lda, B, ldb, beta, D, ldc);

    PrintMatrix(D, nRows, nCols, CblasRowMajor);

    free(D);
    free(C);

    return 0;
}

ztik · Accepted Answer

Short answer is, yes you can use dgemm for rank-1 update. The dger is of course suggested, since it is expected to be better optimized for this operation.

As far as the use of cblas_dgemm . As you know the definition of leading dimension is:

lda: The size of the first dimension of matrix A

The operation you are trying to perform is: D(20x5) = A(20x1) * B(1x5)

You are using CblasRowMajor so leading dimension the number of columns for all matrices (for explanation see https://stackoverflow.com/a/30208420/2707697). Meaning:

lda = 1;
ldb = 5;
ldc = 5;

How do you use cblas_dgemm to do a vector outer product?

Answers (2)

Related Questions