Hello,
For the past month, I have been observing the behavior of the Cloud Run serverless scalability. When the total number of API requests exceeds the number of concurrent requests, Cloud Run takes too much time to create a new instance. Consequently, some API requests timeout, and there is a delay in redirecting the exceeded requests to the new instance. Although my API request timeout is set to 30 seconds, the API continues to return a 504 upstream timeout error. Please let me know if there are any settings missing in the Cloud Run configuration. My maximum concurrency request limit is set to 8, and I am running Node.js code on my server, which is single-threaded.
I appreciate your assistance in advance.
Thank you.
Hi @Accredian,
Welcome to Google Cloud Community!
These might help you:
Hi @Accredian, did you figure out the optimal options for fast creation of instances?