Bronze 5
Since ‎08-03-2023
2 weeks ago

My Stats

  • 15 Posts
  • 0 Solutions
  • 8 Likes given
  • 20 Likes received

r4ruixi's Bio

Badges r4ruixi Earned

View all badges

Recent Activity

We repetitively ran into a CODE_GCE_ERROR when executing a Batch job. There is no error log. The only status change showed the following information:VM in Managed Instance Group meets error: Batch Error: code - CODE_GCE_ERROR, description - error cou...
- copy_json_to_serving_bucket: call: googleapis.storage.v1.objects.copy args: destinationBucket: "target-bucket" destinationObject: "test_data/test.json" sourceBucket: "source-bucket" sourceObject: "test_data/test.json" result: copy_result I ran into...
I am trying to leverage `torchrun` to run GPU computing with GCP Batch. However, it requires a nvidia-container toolkit: https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/latest/install-guide.htmlI cannot find any interface to pre-ins...
I updated the hardware configuration of one of my existing user-managed vertex AI notebooks, and then restarted it. The VM is up but the proxy of jupyter notebook is stuck at "Setting up proxy to JupyterLab". Is there any solution for this issue? I c...
When executing jobs on GCP Batch, especially those jobs using GPUs, we noticed that there are many log messages are marked as "error" but they're actually normal installation logs of GPU drivers. The same issue happened to docker image layer download...