Get hands-on experience with 20+ free Google Cloud products and $300 in free credit for new customers.

Fine-Tuning with Images and Text?!

I want to use a multimodal model such as Gemini to analyze certain characteristics of images. In Vertex AI demo page, I can insert multiple images combined with text to generate a text response. Is there a way to fine-tune a model on text and images to generate a text output?

0 1 705
1 REPLY 1