i subscribed Google Colab Pro+ to train a neural network (seq2seq) for nlp project but after almost an hour when I started the training the runtime has changed to connecting---- and at the bottom the message "Waiting to finish the current execution" has appeared and now the model has been training for more than 6 hours and more than that time is left to accomplish the training also I cannot see the resource that I used in the runtime. so my question is what is the solution for this kind of problem and will I lost all the variables that worked on so far in this session particularly the variable that stores the training history?
Hello @Aways ,
Welcome to Google Cloud Community!
There are several potential reasons for the issue you're encountering with your Google Colab Pro+ runtime:
Solutions:
Regarding your variables:
By addressing these potential causes and following the suggested solutions, you should be able to resolve the issue and successfully train your neural network on Google Colab Pro+.
I hope the above information is helpful.
Thanks, McMaco.
fortunately, my session reconnected to the runtime and finished the training although it took a long time (16 hours). but again my session crashed at the inference phase and it automatically restarted although I tried to test the saved model multiple times I wasn't able to because of the constant session crash. so please any solution.
Thanks
User | Count |
---|---|
2 | |
2 | |
1 | |
1 | |
1 |