First time user of the Vertex AI Auto ML training pipeline. I've made multiple attempts at training a fairly simple dataset (less than 10K observations, no missing data) using tabular regression, and it's always immediately failing with the error "Training pipeline failed with error message: Internal error occurred. Please retry in a few minutes." I've used both MAE and RMSE. Details of one training session below. What am I doing wrong?
This is also happening to me. And i am so confused because the dataset is very simple.
The logs aren't very helpful either.
The most informative log i found was: "The DAG failed because some tasks failed. The failed tasks are: [tabular-stats-and-example-gen]."
Then the logs for tabular-stats-and-example-gen say
error.code: 13
error.message: INTERNAL
have you been able to successfully train the model?
i Have same propblem here, already wasted some hours to check dataset etc and it seems its just pipeline problem not mine 😞
I am getting the same in my recently created project, but I have an older project where everything just works, even with the same data.
No, I haven't trained it yet - that's the step where it's failing. I've read elsewhere that AutoML may have issues with Excel CSV files, which is what I uploaded. I'm going to try saving it as a Google CSV and try that.
For me, it was fixed by granting the requested roles to the service account used when creating a pipeline run for training the model.
I've done granted these roles but still having the error
I am having the same issue here too