Hello everyone,
I'm experiencing a significant performance issue with my RAG engine implementation using the Vertex AI TypeScript library and I'm hoping to get some insights from the community.
Here's a summary of the situation:
This large discrepancy in performance suggests that the issue might be with parameters.
Current parameters:
{
model: "gemini-2.5-flash",
Has anyone else encountered a similar issue? I'm trying to understand what could be causing such a delay.
Any help or suggestions on what to investigate would be greatly appreciated.
Thanks in advance!
Hi @Roksi,
Welcome to Google Cloud Community!
Here are some suggestions that may help resolve the issue:
Was this helpful? If so, please accept this answer as “Solution”. If you need additional assistance, reply here within 2 business days and I’ll be happy to help.