Cloud Run, ideal vCPU and memory amount per instance?

Question

When setting up a cloud run, I am worried about how many memory and vCPU should be set each time per server instance.

I use Cloud Run for mobile apps.

I am confused about when to increase vCPU and memory instead of increasing server instances, and when to increase server instances instead of vCPU and memory.

How should I calculate it?

guillaume blaquiere · Accepted Answer

There isn't a good answer to that question. You have to know the limits:

The max number of concurrent requests that you can handle concurrently with 4cpu or/and 32Gb of memory (up to 1000 concurrent requests)
The max number on instance on Cloud Run (1000)

Then it's a matter of tradeoff, and it's highly dependent of your use case.

Bigger instances reduce the number of cold starts (and so high latency when your service scale up). But, if you have only 1 request at a time, you will pay a BIG instance for a small processing
Smaller instances allow you to optimize cost and to add only a small slice of resource in your cluster, but you will have to spawn often a new instance and you will have several cold start to endure.

Optimize what you prefer, find the right balance. No magic formula!!

Cloud Run, ideal vCPU and memory amount per instance?

Answers (2)

Related Questions