I have a question which's crucial for time inference and efficiency in my company.
We're using Vector Search and filtering to retrieve the rigth samples that we need. But, for efficiency and inference speed, I would like to know how is the retriever working with filtering under the hood.
(1) Is it first doing the similarity search and then filtering the outputs by the metadata that we chose? Or (2) it's first doing the filtering and then searching by similarity just for that metadata?
What I mean with this is; in case 1) it wont do the search directly in all the chunks related to the metadata that I want (let's call it a user)
Sorry if this question should not be here! But I'm looking for answers in all the internet and I cannot find it