Get hands-on experience with 20+ free Google Cloud products and $300 in free credit for new customers.

Vertex API Multimodal Embedding can be blocked if too much request?

Hi, I am currently using Multimodal Embedding for image and I need it fast. So I create process that loop and do parallel request to 10 region of Vertex because there is a limit quota right. But when i want to try it again the previous location for example "asia-south1" can't be requested again with error  400 Project 123xx is not allowed to use Publisher Model projects/project-name/locations/asia-south1/publishers/google/models/multimodalembedding@001. Is there anyone have similar problem?

3 REPLIES 3

Hello,

Thank you for contacting the Google Cloud Community.

I believe you are facing the issue because of the following:

  1. The region might have reached its quota for the model.
  2. There could be specific restrictions on your project using the model in that region.

Regards,
Jai Ade

Thank you for the answer. From my trial and error there is some location that accept the embedding request and not. but there is quota for that request when i check "Quotas & System Limits". I hope there is a better view to see the restriction for region.

Hello,

Thank you for contacting the Google Cloud Community.

I have gone through your reported issue, however it seems like this is an issue observed specifically at your end. It would need more specific debugging and analysis. To ensure a faster resolution and dedicated support for your issue, I kindly request you to file a support ticket by clicking here. Our support team will prioritize your request and provide you with the assistance you need.

For individual support issues, it is best to utilize the support ticketing system. We appreciate your cooperation!