Hi all,
I have been trying to create very small and simple pipelines with only a couple of components with simple Python functions but they seem to all fail with the following error:
"com.google.cloud.ai.platform.common.errors.AiPlatformException: code=RESOURCE_EXHAUSTED, message=The following quota metrics exceed quota limits: aiplatform.googleapis.com/custom_model_training_cpus, cause=null; Failed to handle the pipeline task"
even when directly copying simple pipeline code from the GCP website documentation it fails and gives the same error as above. I have also seen my quotas and limits too and it seems no limits or resources are being exhausted and I assumed this as the pipelines are so small and simple.
I am also getting the same issue when trying to train the most simple ML model within Vertex AI.
Any ideas on why this is happening?
Thank you.
Hi @kaz07,
Welcome to Google Cloud Community!
The "RESOURCE_EXHAUSTED" error, which specifically mentions aiplatform.googleapis.com/custom_model_training_cpus, indicates that your Vertex AI project is being restricted due to requesting more CPU resources than your allocated quota allows. Even if your pipelines seem small, the default configuration might be requesting more CPUs than your project's quota permits, particularly in a new project or a region with high demand. Quotas are in place to ensure fair usage and to prevent any single user from overloading the system. Regarding this, here are some possible approaches you can consider to address the issue:
Was this helpful? If so, please accept this answer as “Solution”. If you need additional assistance, reply here within 2 business days and I’ll be happy to help.