Good time of a Day !
I encounter error when I try to save current prompt with 750 k tokens from 2 kk limit or when I try to add image file with size more than 10 mb ( check attached screenshot ) :
I have asked this question to My Google Cloud Account Executive , She told that I need to search quota "Generate content requests" for model I use .
There is no specific quota for payload size , only " minutes per project per region " .
How to ask to increase payload limit so that I can use more media files when work and collaborate with AI LLM Gemini in Vertex AI ?
All the Graces ^_^ ! ))
Hi @ArhmagosBasaroS,
Welcome to Google Cloud Community!
The error you're encountering isn't specifically due to a "payload size" limit, but rather a restriction on the resources used by a single request to the Gemini model. Large prompts with many tokens and large images demand more processing power, which can exceed the available quotas. Your account executive correctly identified the "Generate content requests" quota as the key factor. There isn't a single "payload size" setting that can be adjusted. When requesting an increase avoid emphasizing a payload limit. Instead, focus on the consumption of resources.
Once you've confirmed that all the necessary steps for the increase have been completed, you can reach out to Google support and work with your account executive to submit the official quota increase request.
Was this helpful? If so, please accept this answer as “Solution”. If you need additional assistance, reply here within 2 business days and I’ll be happy to help.