@dawnberdan @MJane
Vertex AI tuning job for gemini 2.0 stuck at running for 16 hours with a small data set, i.e, 30 entries. It was working fine before, but now it's stuck. By default quota is 1 job, I guess, and in my case, only 1 is running.
I've tried cancelling and restarting, but no effect.
Data dataset is already very simple
all default parameters.
2) How to view logs ?
I am also having the same issue. It's not clear what is causing it. No logs whatsoever. Strange. I will post if I find any workaround.
Thanks. let me know when it resolved for you.
Hi @evo-stage,
Welcome to the Google Cloud Community!
It looks like you are encountering an issue where your Vertex AI Gemini 2.0 tuning job remains indefinitely stuck in the "Validating dataset" stage, even though the dataset is quite small, and the root cause is difficult to pinpoint due to a lack of access to the job logs.
Here are the potential ways that might help with your use case:
Was this helpful? If so, please accept this answer as “Solution”. If you need additional assistance, reply here within 2 business days and I’ll be happy to help.
@MarvinLlamas I've checked the status, no service outages. No issues with the permissions also as it was working perfectly fine till 16th of june.
I've restarted it multiple times and tried to create new job, obviously name is different by adding a timestamp.
I've tried to check cloud logs, that are too dificult to see. cant understand them where to find errors. but in logs I can;t see any errors and also I couldn't find any filter by jobs. It's almost 36 hrs now
Bucket location is : US (multiple regions in United States) while tuning in US Central
@MarvinLlamas I've also checked in logs for my tuning job id theres only one log when tuning job started and there state is like pending