Hi everyone,
I am trying to submit a batch prediction job using the gemini-2.0-flash-001 model, but I keep getting the following error:
google.api_core.exceptions.ResourceExhausted: 429
The following quota metrics exceed quota limits:
aiplatform.googleapis.com/gemini_pro_concurrent_batch_prediction_jobs
I am not using gemini-pro, but rather gemini-2.0-flash-001, so I am unsure why this quota error is occurring.
I also checked the "Quotas" section in Google Cloud Console, but I couldn't find any quota related to aiplatform.googleapis.com/gemini_pro_concurrent_batch_prediction_jobs.
Could this be related to my project’s quota limits? If so, is there a way to check and increase the allowed concurrent batch prediction jobs?
Any insights or solutions would be greatly appreciated!
Thanks in advance!
User | Count |
---|---|
2 | |
2 | |
1 | |
1 | |
1 |