What steps can I use to debug a GoogleGenerativeAI Error: Resource has been exhausted (e.g. check quota)

Question

I am using Google's NodeJS SDK for GoogleGenerativeAI. I am making completion requests:

  const modelObj = this._googleAiClient.getGenerativeModel({ model: 'gemini-1.5-flash' });
  const result = await modelObj.generateContent(prompt);

I am using an API key with billing enabled:

However, I'm consistently getting:

GoogleGenerativeAIFetchError: [GoogleGenerativeAI Error]: Error fetching from https://generativelanguage.googleapis.com/v1beta/models/gemini-1.5-flash:generateContent: [429 Too Many Requests] Resource has been exhausted (e.g. check quota).

If I visit the quotas page: https://console.cloud.google.com/apis/api/generativelanguage.googleapis.com/quotas

and search for gemini-1.5-flash, then the only limit I look remotely close to hitting is "Request limit per model per minute for a project in the free tier" -- however, as this is a Paid Plan, I wouldn't expect to be hitting that.

Does anyone know how I can debug this?

What steps can I use to debug a GoogleGenerativeAI Error: Resource has been exhausted (e.g. check quota)

Answers (1)

Related Questions