Kubernetes Deployment Rolling Updates

Question

I have an application that I deploy on Kubernetes.

This application has 4 replicas and I'm doing a rolling update on each deployment.

This application has a graceful shutdown which can take tens of minutes (it has to wait for running tasks to finish).

My problem is that during updates, I have over-capacity since all the older version pods are stuck at "Terminating" status while all the new pods are created.

During the updates, I end up running with 8 containers and it is something I'm trying to avoid.

I tried to set maxSurge to 0, but this setting doesn't take into consideration the "Terminating" pods, so the load on my servers during the deployment is too high.

The behaviour I'm trying to get is that new pods will only get created after the old version pods finished successfully, so at all times I'm not exceeding the number of replicas I set.

I wonder if there is a way to achieve such behaviour.

RamenOps · Accepted Answer

What I ended up doing is creating a StatefulSet with podManagementPolicy: Parallel and updateStrategy to OnDelete.

I also set terminationGracePeriodSeconds to the maximum time it takes for a pod to terminate.

As a part of my deployment process, I apply the new StatefulSet with the new image and then delete all the running pods.

This way all the pods are entering Terminating state and whenever a pod finished its task and terminated a new pod with the new image will replace it.

This way I'm able to keep a static number of replicas during the whole deployment process.

Kubernetes Deployment Rolling Updates

Answers (2)

Related Questions