Get hands-on experience with 20+ free Google Cloud products and $300 in free credit for new customers.

Regarding my “online_prediction_requests_per_base_model” when using the Claude model problem

 

{
  "error": {
    "code": 429,
    "message": "Quota exceeded for aiplatform.googleapis.com/online_prediction_requests_per_base_model with base model: anthropic-claude-3-5-sonnet. Please submit a quota increase request. https://cloud.google.com/vertex-ai/docs/generative-ai/quotas-genai.",
    "status": "RESOURCE_EXHAUSTED"
  }
}

 

I encountered the above problem when trying to use the model `claude-3-5-20240620` and I confirmed that my credits are not fully utilized yet
I've seen some similar posts about this and the problem I'm currently experiencing is almost identical to theirs, my `online_prediction_requests_per_base_model` limit has also gone to `0` and can't be adjusted, can you help me readjust this limit please?
https://www.googlecloudcommunity.com/gc/AI-ML/Receiving-quota-error-when-trying-to-use-bison-chat-mo...

6 32 3,813
32 REPLIES 32