Get hands-on experience with 20+ free Google Cloud products and $300 in free credit for new customers.

How to expose API for vertex ai search for retail service.

Hi, community members, Happy New Year!

Now I am working on vertex ai search for retail function and need to expose API

This API should take care of lots of requests from lots of users.

I thought it can be done with google cloud function but not sure this is the best approach.

I would like to get an advice on this: what is the best way to expose this API in optimized way.

Thank you for all.

 

 

0 1 373
1 REPLY 1

Hi @pavlodidushko

Welcome to Google Cloud Community!

Using Cloud Functions to expose your Vertex AI Search for Retail API is a good idea, but you'll need to consider handling high traffic from multiple users effectively. 

For your use case that involves high traffic you might also consider using Cloud Run. Cloud Run is the best choice for high-traffic APIs because it's fast, scales predictably, and supports containerized applications. 

Also, here are tools you might consider to optimize performance and scalability : 

Use an API Gateway - It can help manage traffic, enforce rate limits and provide security features. 

Consider Cloud Balancing and Cloud CDN - Load balancers help in distribution of traffic evenly across servers and CDNs reduce redundant requests and improve response time. 

Scale Horizontally - Use horizontal scaling to handle increased traffic by adding more servers or instances. This can be done manually or using auto-scaling features provided by cloud providers.

 Was this helpful? If so, please accept this answer as “Solution”. If you need additional assistance, reply here within 2 business days and I’ll be happy to help.