I seem to have 2 quotas at the same time. One unlimited and one at "0" with exactly the same data. When I send a request to generate a response I get the following error:
Quota exceeded for aiplatform.googleapis.com/online_prediction_requests_per_base_model with base model: anthropic-claude-3-5-sonnet. Please submit a quota increase request. https://cloud.google.com/vertex-ai/docs/generative-ai/quotas-genai.","status":"RESOURCE_EXHAUSTED"
It seems that the quota that is set to "0" is used instead of the "Unlimited" one. It should be noted that I have the account verified with a valid payment method so in theory I shouldn't have any problems.
Also when I have tried to edit the quota from "0" it only lets me keep it at the same number.
User | Count |
---|---|
2 | |
1 | |
1 | |
1 | |
1 |