Why does the Dask client say my cluster has more cores and memory than the actual total available?

Question

I'm trying to understand the relationship among Kubernetes pods and the cores and memory of my cluster nodes when using Dask.

My current setup is as follows:

Kubernetes cluster using GCP's Kubernetes Engine
Helm package manager to install Dask on the cluster

Each node has 8 cores and 30 gb of ram. I have 5 nodes in my cluster:

I then scaled the number of pods to 50 by executing

kubectl scale --replicas 50 deployment/nuanced-armadillo-dask-worker

When I initialize the client in Dask using dask.distributed I see the following

What puzzles me is that the client says that there are 400 cores and 1.58 tb of memory in my cluster (see screenshot). I suspect that by default each pod is being allocated 8 cores and 30 gb of memory, but how is this possible given the constraints on the actual number of cores and memory in each node?

Why does the Dask client say my cluster has more cores and memory than the actual total available?

Answers (1)

Related Questions