Get hands-on experience with 20+ free Google Cloud products and $300 in free credit for new customers.

Why am I getting "ResourceExhausted: 429" error for Gemini-2.0-Flash-001 in batch prediction?

Hi everyone,

I am trying to submit a batch prediction job using the gemini-2.0-flash-001 model, but I keep getting the following error:

google.api_core.exceptions.ResourceExhausted: 429
The following quota metrics exceed quota limits:
aiplatform.googleapis.com/gemini_pro_concurrent_batch_prediction_jobs

I am not using gemini-pro, but rather gemini-2.0-flash-001, so I am unsure why this quota error is occurring.

I also checked the "Quotas" section in Google Cloud Console, but I couldn't find any quota related to aiplatform.googleapis.com/gemini_pro_concurrent_batch_prediction_jobs.

Could this be related to my project’s quota limits? If so, is there a way to check and increase the allowed concurrent batch prediction jobs?

Any insights or solutions would be greatly appreciated!

Thanks in advance!

0 2 173
2 REPLIES 2