I am trying to embed a collection of documents to the vector search index on Vertex AI. The issue is that whenever I embed many documents at once, a subset of them are not embedded. I can not figure out why there are not embedded, nor which documents have been skipped. This issue occurs consistently. When I embed ~35 documents, around 7 get dropped. When I embed ~150 documents, around 15 get dropped.
I have created a GCS staging bucket, a Vertex AI index, endpoint, and have deployed that endpoint to that index. I created the index like so:
```
I create a vector store like so:
```
Hello,
Thank you for contacting Google Cloud Community!
I belive you are encountering inconsistent results when embedding documents into a Vertex AI vector search index, with a subset of documents not being embedded despite being present in the GCS bucket and the generated documents.json file.
Regards,
Jai Ade
Hello,
Thank you for your engagement regarding this issue. We haven’t heard back from you regarding this issue for sometime now. Hence, I'm going to close this issue which will no longer be monitored. However, if you have any new issues, Please don’t hesitate to create a new issue. We will be happy to assist you on the same.
Regards,
Jai Ade
User | Count |
---|---|
2 | |
2 | |
1 | |
1 | |
1 |