Get hands-on experience with 20+ free Google Cloud products and $300 in free credit for new customers.

Cannot increase machine type on AI Platform Prediction

We are running GPU-based online predictions on our AI Platform.
We are applying auto-scaling, but wanted to improve the machine type because the CPU load is higher than the GPU.

The original machine type we were using was n1-standard-4 and we tried to raise it from there to n1-standard-8 or higher, but we got an error on versions create.
The details of the error are below.
```
"error": {
"code": 429,
"message": "The requested number of n1-standard-8 exceeds the quota limit. Current usage/limit: 0/20, Requested: 24.",
"status": "RESOURCE_EXHAUSTED"
}
```
I thought I could apply to increase the quota, but I don't know which quota it corresponds to.

Otherwise, I tried the following
1. if minNodes=1 in n1-standard-8, I was able to deploy, but could not find the resource that was being used more. 2.
2. we were able to create a new n1-standard-16 (with GPU) in Compute Engine.

0 6 691
6 REPLIES 6