Add limits to requests (quotas)

Question

I have been looking at the different quotas for VertexAI.

I have checked the "quotas & system limits" for Vertex AI and there are thousands of quotas.

I am currently testing Vertex AI SDKs specifically Gemini and other models. I am trying things like ChatPrompts, TextPrompts, etc.

Eg.: https://cloud.google.com/vertex-ai/docs/generative-ai/text/test-text-prompts

I would like to limit the API requests per minute/day. Can someone help me understand which quotas should I limit in the "quotas & system limits" as there are thousands of quotas.

Thanks

Add limits to requests (quotas)

Answers (1)

Related Questions