Get hands-on experience with 20+ free Google Cloud products and $300 in free credit for new customers.

2.0 API Quota Error Despite 0% Usage in Console

Hello Google Cloud Community!

I'm trying to implement a simple video generation interface in Google Colab using Google's Veo 2.0 model via the Vertex AI REST API, but I've hit a puzzling roadblock.

According to my Google Cloud Console, I have a quota of 10 requests per minute for the veo-2.0-generate-001model in us-central1, and my current usage shows 0%. However, every single API request fails with a 429 error:

 
Quota exceeded for aiplatform.googleapis.com/online_prediction_requests_per_base_model with base model: veo-2.0-generate-001. Please submit a quota increase request.

My implementation:

I'm particularly confused because:

  1. The console shows 0% quota usage
  2. I haven't successfully made a single request yet
  3. I get the quota error immediately on the first attempt

Has anyone else experienced this discrepancy between reported quota usage and actual API behavior with newer generative models like Veo? Is there perhaps a hidden quota limit or some initialization step I'm missing?

Any guidance would be greatly appreciated!

Thanks in advance,
-J

1 1 1,157
1 REPLY 1