vertex AI llama-2 7b deploy issue "CustomModelServ...

cloudy_watcher · 01-01-2024 12:20 PM

I am trying to deploy llama-2 7b model on vertex AI but i am continiously getting Error message: "The following quotas are exceeded: CustomModelServingL4GPUsPerProjectPerRegion" this error

Mustafa_21

You need to request quota that contains these GPUs

L4 GPUs

Make sure of the Region and that the GPUs for inference not training

cloudy_watcher

@Mustafa_21 But can you tell me how I request quota that contains these GPUs ?

gyanveda

@cloudy_watcher where you able to figure this out? I'm getting the same error.

vertex AI llama-2 7b deploy issue "CustomModelServingL4GPUsPerProjectPerRegion"