Behzad Pirvali
Behzad Pirvali

Reputation: 784

Storm fault tolerance: Nimbus reassigns worker to a different machine?

How do I make storm-nimbus to restart worker on the same machine?

To test the fault tolerance, I do a kill -9 on a worker process expecting the worker to be restarted on the same machine, but on one of the machines, nimbus launches the worker on another machine!!! Nimbus log does not show several tries or anything unusual or errors!

Would appreciate any help, Thanks!

Upvotes: 1

Views: 954

Answers (1)

Gordon Seidoh Worley
Gordon Seidoh Worley

Reputation: 8088

You shouldn't need to. Workers should be able to switch to an open slot on any supervisor. If you have a bolt that doesn't accomodate this because it is reading data on a particular supervisor, this is a design problem.

Additionally, Storm's fault tolerance is intended to handle not only worker failures, but also supervisor failures, in which case you won't be able to restart a worker on the same supervisor. You shouldn't need to worry where a worker is: that's a feature of Storm.

Upvotes: 1

Related Questions