Is there any limitation on generating embeddings? I tried to generate embeddings for 152 chunks, with each chunk containing only one array element. However, I am still encountering the '429 Quota Exceeded' error. I have attached the screenshot for your reference. In the documentation, the request quota is mentioned as 600 requests per minute. Can you help me fix this?
Hi @KannanG03,
I appreciate your inquiry and expanding awareness of the community.
I understand that the documentation mentioned about 600 requests per minute inline with embedding, it has also mentioned that resource management are not limited to the number of request listed in the quota table, it can also be affected by other factors such as the number of jobs it makes and the accumulated data size or output.
* Resource management requests include any request that is not a job, long-running operation, online prediction request, or Vertex AI Vizier request.
You can review your project's Quota page to determine related services that are exceeding the quota. You may need to requesting a higher quota limit to fix the issue.
I found a thread in the community that is somewhat related to your concern (service health history), this may be a re-occurrence that needs the Vertex AI Support team's attention.
Hope this helps.
I have similar problem:
>promise catch: [VertexAI.ClientError]: got status: 429 Too Many Requests, code: undefined
although surely I don't hit the Vertex Quota limit, see photo attached.
what could be the problem?
No reply for a problem posted back in 02-07-2024?
I'm getting the same error, but just with langchain v0.2
Works:
```
Any update on this? I'm having a similar issue with `text-unicorn`. Was working in an earlier version of langchain, but after updating to the most recent release, I'm getting the 429 Quota exceeded error 🤔
I have the same issue - keep getting 429 error with the Vertex AI API but am no where near my quota limits.
I'm also encountering this, not really sure why its happening.
We are also having this same problem. Specifically when using the Gemini chat function. This is making it prohibitive to building and testing applications based on AI agents. This is a critical problem. Has anyone progressed through this problem?
User | Count |
---|---|
2 | |
1 | |
1 | |
1 | |
1 |