Hello Google Cloud Community,
I am fine-tuning Gemini 1.0 Pro on a small dataset of approximately 5-10 MB and need your help determining the hosting, deployment, and training costs. My goal is to get an accurate estimate of the infrastructure requirements and associated costs. Although I have reviewed the Gemini token pricing and tried using the Google Cloud Pricing Calculator, I couldn’t find detailed information that aligns with my specific use case.
I would appreciate any insights or recommendations that can guide me toward a more accurate cost estimation. Thank you for your time and expertise!
Solved! Go to Solution.
Hi @Adityapurohit,
Welcome to Google Cloud Community!
In addition to @abhishekbhagwat, to get a better estimate of the costs for fine-tuning Gemini 1.0 Pro on your dataset, let's break down your requirements into three main areas: machine type, hosting costs, and training costs.
1. Machine Type for training: For fine-tuning models like Gemini 1.0 Pro, using a machine with GPU or TPU capabilities is ideal. Here are some suggestions:
Recommendation: If your budget allows, consider an n1-standard-8 instance with a single A100 GPU or a TPU v2 for efficient training.
2. Hosting Costs: For hosting the trained model for inference, costs depend on the expected traffic and usage patterns. You might consider the following:
3. Training Costs: Training costs depend on the duration and type of resources used.
To get an estimate, try using the Google Cloud Pricing Calculator with your specific configurations. It’ll provide figures based on your usage.
I hope the above information is helpful.
Hi!
We offer a managed tuning service for Gemini - https://cloud.google.com/vertex-ai/generative-ai/docs/models/gemini-use-supervised-tuning
Tuning is charged for the total number of tokens in your dataset rather than the machine costs.
Predictions on the tuned endpoints are the same as the untuned gemini endpoints, we do not charge anything extra for inference on the tuned endpoints - https://cloud.google.com/vertex-ai/generative-ai/pricing#gemini-models
Hope this helps!
Hi @Adityapurohit,
Welcome to Google Cloud Community!
In addition to @abhishekbhagwat, to get a better estimate of the costs for fine-tuning Gemini 1.0 Pro on your dataset, let's break down your requirements into three main areas: machine type, hosting costs, and training costs.
1. Machine Type for training: For fine-tuning models like Gemini 1.0 Pro, using a machine with GPU or TPU capabilities is ideal. Here are some suggestions:
Recommendation: If your budget allows, consider an n1-standard-8 instance with a single A100 GPU or a TPU v2 for efficient training.
2. Hosting Costs: For hosting the trained model for inference, costs depend on the expected traffic and usage patterns. You might consider the following:
3. Training Costs: Training costs depend on the duration and type of resources used.
To get an estimate, try using the Google Cloud Pricing Calculator with your specific configurations. It’ll provide figures based on your usage.
I hope the above information is helpful.
Thankyou for this information 😊