Reputation: 588
How to configure Flink such that the Taskmanagers auto restart after a failure ?
Upvotes: 0
Views: 375
Reputation: 43454
On yarn and kubernetes Flink has a native resource manager (YarnResourceManager
and KubernetesResourceManager
) that will arrange for the requested number of slots to be available. In other environments you'll need to use cluster-framework-specific mechanisms to take care of this yourself.
Note that for k8s, only session clusters are supported by this new, more active mode implemented by KubernetesResourceManager
. Job clusters still need to be managed in the old fashioned way, as described in the docs.
And then there are managed Flink environments where these details are taken care of for you -- e.g., Ververica Platform or Kinesis Data Analytics.
Upvotes: 1