Hi,
I have a job that runs with decent consistency using docker container runnables and c2-standard-16 instances where I specify both the instance type and compute resources. When we set the computeResource cpuMilli to be 4vCPU and the memoryMib to be 16GB, our job seems to successfully complete with high frequency. When we increase the computeResource to 8vCPU and 32GB RAM, our jobs get incredibly slow (runtime goes from roughly 2min to upwards of 30min with some tasks failing with an instance unresponsive/OOM). Looking at our quotas, I can see that we're not anywhere close to running into limits. When I look at the Batch Agent Logs for the case of increased CPU/RAM resources, I notice a lot of the following messages:
rpc error: code = Unavailable desc = 502:Bad Gateway. Retrying in 3.09654082s
Solved! Go to Solution.
@jfishbein Thanks reporting the issue. We believe this is related to a recent bug and the fix is being rolled out. If everything goes as planned, the issue should be fixed within one or two days. Sorry for the inconvience, please try again later.
@jfishbein Thanks reporting the issue. We believe this is related to a recent bug and the fix is being rolled out. If everything goes as planned, the issue should be fixed within one or two days. Sorry for the inconvience, please try again later.