We are running GPU-based online predictions on our AI Platform.
We are applying auto-scaling, but wanted to improve the machine type because the CPU load is higher than the GPU.
The original machine type we were using was n1-standard-4 and we tried to raise it from there to n1-standard-8 or higher, but we got an error on versions create.
The details of the error are below.
```
"error": {
"code": 429,
"message": "The requested number of n1-standard-8 exceeds the quota limit. Current usage/limit: 0/20, Requested: 24.",
"status": "RESOURCE_EXHAUSTED"
}
```
I thought I could apply to increase the quota, but I don't know which quota it corresponds to.
Otherwise, I tried the following
1. if minNodes=1 in n1-standard-8, I was able to deploy, but could not find the resource that was being used more. 2.
2. we were able to create a new n1-standard-16 (with GPU) in Compute Engine.
Good day @omonao,
I highly recommend that you reach out to Google Cloud Support for this inquiry, so that they can investigate your project quotas and project resources. You can use this link: https://cloud.google.com/support
Thank you for being part of Google Cloud Community
Hi, @kvandres ,
Our project is currently only on basic support, do we have to upgrade our support and inquire?
Thanks!
Hi @omonao,
The error message you received indicates that the requested number of n1-standard-8 machine types exceeds the quota limit. The error message specifically states that the current usage/limit is 0/20, and you requested 24 instances.
To resolve this issue, you need to request a quota increase for the n1-standard-8 machine type. Here's how you can do it:
Google Cloud support will review your request and respond accordingly. They will let you know if the quota increase is approved or if there are any additional steps required.
In the meantime, you have a couple of alternatives:
Remember to monitor your usage and resources to ensure they align with the quota limits and requirements of your project.
Hope this helps!
Hi, @Roderick
Thank you for replying!
I tried to investigate on the quota page, but could not find the target quota by searching for "n1-standard-8" in the Metric field.
The same is true when I search for "n1", or try in a different project.
Is there a problem with the project settings or something else?
Thank you for your help.
Hi Omonao,
I am having this same issue. Were you able to get this resolved?
Same issue here.
User | Count |
---|---|
2 | |
1 | |
1 | |
1 | |
1 |