I seem to have 2 quotas at the same time. One unlimited and one at "0" with exactly the same data. When I send a request to generate a response I get the following error:
Quota exceeded for aiplatform.googleapis.com/online_prediction_requests_per_base_model with base model: anthropic-claude-3-5-sonnet. Please submit a quota increase request. https://cloud.google.com/vertex-ai/docs/generative-ai/quotas-genai.","status":"RESOURCE_EXHAUSTED"
It seems that the quota that is set to "0" is used instead of the "Unlimited" one. It should be noted that I have the account verified with a valid payment method so in theory I shouldn't have any problems.
Also when I have tried to edit the quota from "0" it only lets me keep it at the same number.
same here. I am trying to fix this as well. so far no luck
same issue here. The quota for anthropic-claude-3-5-sonnet is fixed at 0, and it’s not possible to request a quota exceeding 0. The system message suggesting to contact the Sales Team is of no use, as they do not respond.
Even though I have paid, I cannot use the service.
Precisely the same issue here. Two quotas, one "unlimited" and the other "0", for the same region/model.(us-east5, anthropic-claude-3-5-sonnet)
User | Count |
---|---|
2 | |
2 | |
1 | |
1 | |
1 |