Hello everyone,
I’m encountering a persistent issue with the Claude-3-haiku API. Despite having a quota of 10 requests per minute, I’m frequently receiving the following error message:
[
{
"error": {
"code": 429,
"message": "Quota exceeded for aiplatform.googleapis.com/online_prediction_tokens_per_minute_per_base_model with base model: anthropic-claude-3-haiku. Please submit a quota increase request. https://cloud.google.com/vertex-ai/docs/generative-ai/quotas-genai.",
"status": "RESOURCE_EXHAUSTED"
}
}
]
This error indicates that I’ve exceeded the quota, but according to my current configuration, I should be able to make 10 requests per minute. I even applied for and received an upgrade to 20 requests per minute, but the issue persists.
Does anyone have any insights or suggestions on how to resolve this? Any help would be greatly appreciated.
User | Count |
---|---|
2 | |
1 | |
1 | |
1 | |
1 |