Get hands-on experience with 20+ free Google Cloud products and $300 in free credit for new customers.

Constant resource exhaustion errors (429)

I am using the free plan with an average of 1.5 requests per minute and under 1500 tokens per request and I run into quota exceeded errors "google.api_core.exceptions.ResourceExhausted: 429 Resource has been exhausted (e.g. check quota)." 

ghofnar_0-1739445671248.png

I am also implementing time sleeps between the requests to avoid running into issues but the error persists. 

Anyone knows if this is a google issue? I see a number of people are facing the same issues as me

1 1 176
1 REPLY 1

Hi @ghofnar,

Welcome to Google Cloud Community!

I understand you're receiving quota exceeded errors (error 429) even though your usage appears to be within the free tier limits, based on the screenshot you provided.

With regard to the error that you received, if the number of your requests exceeds the capacity allocated to process requests, then error code 429 is returned. You may check this page for guidance on how to rectify this issue.

In addition, according to this documentation, Gemini 1.5 Flash and Gemini 2.0 Flash support Dynamic Shared Quota (DSQ) which eliminates the need to set quota limits and to submit quota increase requests (QIRs). If you need higher throughput, consider Google's Provisioned Throughput. Note that it is currently in Preview and access must be requested.

This similar case might offer some helpful ideas. You can also explore this site for insights on possible workarounds.

Was this helpful? If so, please accept this answer as “Solution”. If you need additional assistance, reply here within 2 business days and I’ll be happy to help.