Get hands-on experience with 20+ free Google Cloud products and $300 in free credit for new customers.

Vertex AI API quotas. Documentation discrepancy.

Good afternoon,

According to the documentation at https://cloud.google.com/vertex-ai/generative-ai/docs/quotas#quotas_by_region_and_model, the base model gemini-1.0-pro in the Iowa region (us-central1) should have available quotas of 300 rpm. However, in reality, the limit is set to 5 rpm.

I thought it might be because it's a free account, so I upgraded to a paid account, but nothing changed. Following a recommendation, I made a manual payment hoping that this would make the quotas align with the documentation, but again, nothing changed.

What could be the issue and who should I contact? The sales team said they couldn't help and advised contacting support, but you can only subscribe to support if you have an organization.

Screenshot 2024-05-02 at 10.43.47.png

 

6 3 536
3 REPLIES 3