I'm attempting to run a basic pipeline using Kubeflow in Vertex AI. However, when I run it I receive a RESOURCE_EXHAUSTED error in the logs relating to aiplatform.googleapis.com/custom_model_training_cpus.
Checking my quotas, I can see that I am not anywhere close to exhausting any of the quotas under aiplatform.googleapis.com/custom_model_training_cpus (including the region I'm using - us-central1).
Has anyone had a similar issue and know what is going on here?
And here's the exact error:
Resource exhausted usually refers to the amount of memory used while running the code, not exactly the region quotas. Did you define the machine type in Kubeflow training and deployment, like?