Scala foldLeft performance 4x worse than for loop when working with floats

Question

Can someone with broader knowledge of the subject explain, why is

var acc = 0.0
for (i <- 0 until 100)
    acc += 4.0 * (1 - (i % 2) * 2) / (2 * i + 1)

over 4 times faster than

(0 until 100).foldLeft(0.0)({
    (d, i) => d + 4.0 * (1 - (i % 2) * 2) / (2 * i + 1)
})

I prefer the functional version, just don't quite understand the peformance hit. Any way to optimize this or maybe a foldLeft alternative?

dhg · Accepted Answer

First, This is way too small of an example to get an accurate benchmark. You can't really tell anything from these numbers. You also have to control for things like JVM warm up, which will totally throw off any numbers you get.

That said, you can look at the source to see exactly how these things are implemented. In particular, Range's foldLeft is defined on TraversableOnce, which you can view here:

def foldLeft[B](z: B)(op: (B, A) => B): B = {
  var result = z
  this foreach (x => result = op(result, x))
  result
}

As you can see, foldLeft is just delegating to foreach, which is exactly what the for(...) syntax resolves to.

In other words, they are actually doing the same thing.

The general rule is that you're probably not going to see a real difference performance-wise on anything like this unless you're doing an enormous number of calculations. If it really is an issue, the fastest thing to do would be to use a while loop.

Scala foldLeft performance 4x worse than for loop when working with floats

Answers (2)

Related Questions