Get hands-on experience with 20+ free Google Cloud products and $300 in free credit for new customers.

LLAMA 2 in Vertex AI not working

I deployed llama-2 13 B and 70 B in vertex ai through the model garden. Deployment was successful but when I am hitting the endpoint through curl I keep on getting below error. Has anyone tried llama-2 in vertex ai?

{
  "error": {
    "code": 503,
    "message": "Took too long to respond when processing endpoint_id: {endpoint_id}, deployed_model_id: {deployed_model_id}",
    "status": "UNAVAILABLE"
  }
}
2 13 6,039
13 REPLIES 13