Hi,
I'm stuck at following error message when I try to deploy custom model to vertex-ai endpoint.
Command:
gcloud ai endpoints deploy-model {ENDPOINT_ID}\
--region={REGION} \
--model={MODEL_ID} \
--display-name={DEPLOYED_MODEL_NAME} \
--machine-type=n1-standard-2 \
--enable-access-logging \
--enable-container-logging \
--min-replica-count=1 \
--max-replica-count=100 \
--traffic-split=0=100
You may check here some things you can check further when deploying your model. If this does not work, it would be helpful to file a 1:1 support case since they can check your internal resources.
Try removing the "--traffic-split" argument, it should be an array.
--traffic-split=[blue=50, green=50]"
Or add "--deployed-model-id" with a hardcoded value (not "0").