Estimating simulation run time based on sample runs

Question

I need to run a simulation over a weekend. I would like the simulation to be large enough that it will be as descriptive as possible, but I don't want it doesn't finish by the time I get back to work and I can't continue working on this software.

The algorithm is such that once started, it must complete or there is no point in running it at all.

The individual elements of the simulation run in a factorial, n^4, and n^2

   n:
<= 6  =         0ms
   7  =         8ms
   8  =        91ms
   9  =     1,089ms
   10 =    14,666ms
   11 =   197,288ms
   12 = 3,091,739ms

I have fit a curve to these samples in WolframAlpha here. The two things that concern me are firstly that the n^4 component is negative, which doesn't make any sense since it is certainly a contributing factor to the run-time. The other thing is, I have tried to estimate run times in the past in similar situations and my extrapolations are usually way off.

Do any of you have experience with guessing the run-time of an algorithm based on the input size in this way?

salva · Accepted Answer

In general, when you have some O(N⁴) your also have to introduce terms for O(N³), O(N²), O(N) and O(1). In other words, try adding x³, x¹, and x⁰ into the curve fitting model.

For this particular case, where you have a O(N!), well, I would follow amit advice and consider only the factorial part as it seems to converge quite fast.

But in any case, if you really have a O(N!) you don't need to estimate, just use a iterative deepening approach. Let your computer iteratively run the case for n=1,2,3,4,5,6,7... and let it go as far as it can.

It may seem that you are wasting your computer time, but if you analyze it, you will see that the wasted time is insignificant. For instance, you are already at n=12, so for n=13 the required CPU C₁₃ would be 13*C₁₂, C₁₂ = 12*C₁₁ and so on. Introducing your measurements, sum(C₁₃..C₀)/C₁₃ = 1.082, so running your function for all the values from 0 to 13 will be only 8% more expensive than just running it for 13. And as you go for bigger values of N, this percentage will mitigate even further.

update:

Why you need to add terms for all the powers below the complexity level:

Consider a simple three level loop with complexity O(N₃):

void foo(int n) {
  int i, j, k;
  for (i = 0; i < n; i++)
    for (j = 0; j < n; j++)
      for (k = 0; k < n; k++)
        do_something(i, j, k);
}

foo(3);

It is obvious that the do_something(i, j, k) is called n³ times.

But it we start from the beginning considering every instruction executed, we can see than entering and leaving the function, setting the stack and other low level tasks are done once; the i=0 instruction is also performed once. These are instructions corresponding to the n⁰ cost.

The instructions i < n, i++ and j=0 are called n times, they correspond to the n¹ term.

The instructions j < n, j++ and k=0 are called n² times, they correspond to the n² term.

Well, and so on.

More complex cases are the same, you always have instructions running a number of times proportional to all the powers below that of the complexity level.

Regarding you measurement of C(0) = 0, it is just a problem of your timings not being accurate enough. It could be very small but never an absolute 0.

And finally, if your curve fitting doesn't work, it is because of the N! part as in that case you will also have instructions running (n-1)! times, and (n-2!) times and so on.

Estimating simulation run time based on sample runs

Answers (2)

Related Questions