Hi community,
I am using the model : gemini-experimental, and eventhough I do not exceed the quota limit (10), I am getting the error
ClientError: 429 RESOURCE_EXHAUSTED. {‘error’: {‘code’: 429, ‘message’: ‘Online prediction request quota exceeded for gemini-experimental. Please try again later with backoff.’, ‘status’: ‘RESOURCE_EXHAUSTED’}}
I checked the quota graph and the max is usage is 7, no sure why it says RESOURCE_EXHAUSTED
Does anyone have faced this?
Solved! Go to Solution.
Hi @luiscs-dj,
Welcome to Google Cloud Community!
It looks like you are encountering a RESOURCE_EXHAUSTED error message, specifically a 429 error code, while using the Gemini–Experimental model, even though you haven’t reached the maximum limit of your quota.
Here are the potential ways that might help you with your use case:
Was this helpful? If so, please accept this answer as “Solution”. If you need additional assistance, reply here within 2 business days and I’ll be happy to help.
Hi @luiscs-dj,
Welcome to Google Cloud Community!
It looks like you are encountering a RESOURCE_EXHAUSTED error message, specifically a 429 error code, while using the Gemini–Experimental model, even though you haven’t reached the maximum limit of your quota.
Here are the potential ways that might help you with your use case:
Was this helpful? If so, please accept this answer as “Solution”. If you need additional assistance, reply here within 2 business days and I’ll be happy to help.
@MarvinLlamas thank you for your help. I implemented the exponential backoff and it seems to be working.
User | Count |
---|---|
2 | |
1 | |
1 | |
1 | |
1 |