Performance issue in Finite difference method

Question

I wrote a piece of C code that uses finite difference method to estimate values. This is an averaging method. I profiled the code and found that one iterate() function is the slowest.

void iterate(double data[][ARRAY_SIZE], int nx, int ny, int dx, int dy)
{
    for (int i = 0; i < nx; ++i)
    {
        for (int j = 0; j < ny; ++j)
        {
            if (i % (dx + 1) == 0 && j % (dy + 1) == 0)
                continue;
            else if (i == 0 && 0 < j && j < ny)
                data[i][j] = (data[i][j - 1] + data[i][j + 1] + data[i + 1][j]) / 3;
            else if (j == 0 && 0 < i && i < nx)
                data[i][j] = (data[i - 1][j] + data[i + 1][j] + data[i][j + 1]) / 3;
            else if (i == nx - 1 && 0 < j && j < ny)
                data[i][j] = (data[i][j - 1] + data[i][j + 1] + data[i - 1][j]) / 3;
            else if (j == ny - 1 && 0 < i && i < nx)
                data[i][j] = (data[i - 1][j] + data[i + 1][j] + data[i][j - 1]) / 3;
            else
                data[i][j] = (data[i - 1][j] + data[i + 1][j] + data[i][j - 1] + data[i][j + 1]) / 4;
        }
    }
}

This loop runs slow, and I am not sure what I am missing here that makes it slow. Is there a better way of doing the same?

2000 iterations with a 400x400 double array takes

real    0m1.950s
user    0m1.940s
sys 0m0.004s

John Zwinck · Accepted Answer

Here are some ideas:

It appears that ny must equal ARRAY_SIZE. You may as well omit it as a parameter and just use the compile-time constant.
All the if/else clauses except the final one are only applicable to a specific row or column. So hoist them out. For example you can process the first row and column as 1D loops before doing the entire matrix outside the edges, then finally process the rightmost column and bottom row.

In the end, your core loop should be more like this:

for (int i = 1; i < nx - 1; ++i)
{
    for (int j = 1; j < ARRAY_SIZE - 1; ++j)
    {
        data[i][j] = (data[i - 1][j] + data[i + 1][j] + data[i][j - 1] + data[i][j + 1]) / 4;
    }
}

Performance issue in Finite difference method

Answers (2)

Related Questions