Get hands-on experience with 20+ free Google Cloud products and $300 in free credit for new customers.

"Exceeded" quota when using Anthropic models

Kris49120_0-1726151615298.png

I seem to have 2 quotas at the same time. One unlimited and one at "0" with exactly the same data. When I send a request to generate a response I get the following error:



Quota
exceeded for aiplatform.googleapis.com/online_prediction_requests_per_base_model with base model: anthropic-claude-3-5-sonnet. Please submit a quota increase request. https://cloud.google.com/vertex-ai/docs/generative-ai/quotas-genai.","status":"RESOURCE_EXHAUSTED"

It seems that the quota that is set to "0" is used instead of the "Unlimited" one. It should be noted that I have the account verified with a valid payment method so in theory I shouldn't have any problems.

Also when I have tried to edit the quota from "0" it only lets me keep it at the same number.

3 REPLIES 3

same here. I am trying to fix this as well. so far no luck

same issue here. The quota for anthropic-claude-3-5-sonnet is fixed at 0, and it’s not possible to request a quota exceeding 0. The system message suggesting to contact the Sales Team is of no use, as they do not respond.

Even though I have paid, I cannot use the service.

Precisely the same issue here. Two quotas, one "unlimited" and the other "0", for the same region/model.(us-east5, anthropic-claude-3-5-sonnet)