With Scala, what's the (practical) limit number of Actors in a single process?

Question

I have a simulation code that creates N samples, and process it one by one. With IoC (Inversion of control), the processor method is given a method to be invoked. The number of samples are controlled from the configuration["iteration"] map.

This is the processor method:

def processor(configuration:Map[String, Any], f:(Int, Summary) => Unit) = {
    val byteWidth:Int = configuration.getOrElse("byteWidth", 4).asInstanceOf[Int]
    ...
    val iteration:Int = configuration.getOrElse("iteration", 10000).asInstanceOf[Int]

    val caller = self
    (1 to iteration).foreach { i =>
            ...
            val newBf = Summary.makeIt() // generate simulation data
            f(i, newBf)
        }
    }

}

This is the caller functions:

val conf = MMap[String, Any]()
conf("iteration") = 100000
def calculate(i:Int, bf: Summary) : Unit = {
  ... 
}
processor(configuration = conf.toMap, calculate)

This code works fine, but it does not use multi-core, so I modified the processor to use actor.

def parallelProcess(configuration:Map[String, Any], f:(Int, Summary) => Unit) = {
    ...
    val iteration:Int = configuration.getOrElse("iteration", 10000).asInstanceOf[Int]

    val caller = self
    (1 to iteration).foreach { i =>
      actor {
        caller ! {
            ...
            val newBf = Summary.makeIt() // generate simulation data
            f(i, newBf)
        }
      }
    }
    (1 to iteration).foreach { i =>
      receive {
        case msg => msg
      }
    }

}

This code works fine, and it uses all the cores that I have. Even when I create 100K samples (and accordingly actors), it works OK, but it slows down with 1 million actors, it slows down a bit, and 10 million samples it becomes very slow with all the cores are busy.

I expect that too may actors may be the culprit, even 100K actors seem pretty large number already, but works fine.

How many actors are too much actors? Is there way to control the number of actors for this kind of problem?

Lanny Ripple · Accepted Answer

An actor is a lightweight unit of control. It's basically a PartialFunction describing how to respond to a message and a mailbox containing messages. A central event loop takes a message out of the mailbox, sees if the actor will respond to it, and if so schedules the execution to happen on a thread. Threads are a bit heavier. They weigh in at about 1MB of memory per thread. Among perfectly CPU-bound processing you really only need 1 thread per core.

The question of how many actors are too many will depend on your application. You'd basically need to figure out how much cpu time per message you use on average and how many cores you have. If say 20ms per message with 8 cores you could provide a threadpool of size 8 and 400 actors and you should be maxing out all your cpu time. Just because you can provide more actors and/or more threads doesn't mean you'll get things done quicker. You'll reach a minimum and then start adding time to your runs as the overhead of managing the threads and actors starts to come into play.

You can modify Akka's configuration to control the threadpool size or you manage custom execution contexts. See the very fine docs for details.

With Scala, what's the (practical) limit number of Actors in a single process?

Answers (2)

Related Questions

With Scala, what&#39;s the (practical) limit number of Actors in a single process?

Answers (2)

Related Questions

With Scala, what's the (practical) limit number of Actors in a single process?