I am new to GCP and I want to deploy my flask app that basically contacts via webhooks to WhatsApp api and then answers the customer queries on WhatsApp using Gemini api and rag. The app is around 500mb when all the libraries in the requirements.txt are included. The app is going to have around 10000 requests per month to reply to WhatsApp query, any idea what configuration should I use and which service should I use. Thanks in advance!
Note: The app does not use GPU
Solved! Go to Solution.
Hello,
Thank you for contacting Google Cloud Community !
Based on your description, App Engine (Flexible Environment) or Cloud Run seem like suitable choices. Both offer:
1. Scalability to handle 10,000 requests/month.
2. Pay-per-use model, cost-effective for your anticipated traffic.
3. Ease of deployment compared to Compute Engine.
Remember, you can always start with a smaller configuration and scale up as your traffic increases. Explore the documentation for App Engine[1] and Cloud Run[2] to learn more about deployment, configuration options, and pricing.
[1] https://cloud.google.com/appengine/docs
[2] https://cloud.google.com/run/docs
Regards,
Jai Ade