I'm trying to understand the pricing for image inputs to Gemini Pro, which seems to be contradictory.
On this page, it's written that the token cost per image is calculated by splitting the image into tiles of 258 tokens each. So a larger image is likely to incur higher token costs than a smaller one. My rough calculations say a maximum sized image would require 16 tiles x 258 tokens/tile = 4,128 tokens.
Here's how tokens are calculated for images:
But here, we see a fixed cost of $0.001315 per image (current Gemini Pro pricing). No mention of tiles or maximum dimensions for that price.
This seems to be contradictory. Anybody know which is the correct info?
Solved! Go to Solution.
Hi @matvei,
Each model has a maximum number of tokens that it can handle in a prompt and response. Knowing the token count of your prompt lets you know whether you've exceeded this limit or not. The token calculations outlined in the first link is useful when calculating or estimating the size of your total prompt, but it is not the cost of the prompt.
Your second link is the pricing of images in the prompt. As an example your billing report would have a line item that would include:
Gemini 1.5 Pro Image Input - Predictions (the SKU), XX images input, total $$ cost for images.
User | Count |
---|---|
2 | |
2 | |
1 | |
1 | |
1 |