Hello Google Cloud Community!
I'm trying to implement a simple video generation interface in Google Colab using Google's Veo 2.0 model via the Vertex AI REST API, but I've hit a puzzling roadblock.
According to my Google Cloud Console, I have a quota of 10 requests per minute for the veo-2.0-generate-001model in us-central1, and my current usage shows 0%. However, every single API request fails with a 429 error:
Quota exceeded for aiplatform.googleapis.com/online_prediction_requests_per_base_model with base model: veo-2.0-generate-001. Please submit a quota increase request.
My implementation:
I'm particularly confused because:
Has anyone else experienced this discrepancy between reported quota usage and actual API behavior with newer generative models like Veo? Is there perhaps a hidden quota limit or some initialization step I'm missing?
Any guidance would be greatly appreciated!
Thanks in advance,
-J
Hi no_notes,
Welcome to the Google Cloud Community!
The 429 RESOURCE_EXHAUSTED error you're encountering with Veo 2.0, even with the console showing 0% quota usage, is a known issue with newer models on Vertex AI. Here are some approaches that might help to resolve your issue:
Was this helpful? If so, please accept this answer as “Solution”. If you need additional assistance, reply here within 2 business days and I’ll be happy to help.
User | Count |
---|---|
2 | |
1 | |
1 | |
1 | |
1 |