About danon

danon · 12-23-2024

Is it possible to get a summary of an audio conversation with Gemini's multimodal live API in the same session? Theoretically you can config a session to support both audio and text generation_config = { "responseModalities": ["TEXT", "AUDIO"],... So...

danon · 12-30-2024

Have you tried it? Mentioning because I have tried this and another ideas and basically nothing works. That is what I say in the first place "I give up", because it simply doesn't work. The caveat, not documented anywhere, is that although Gemini 2.0...

danon · 12-30-2024

Can you please elaborate more on that workaround? Thanks

danon · 12-27-2024

I give up, at least by now. I've tried any possible combination. I don't think that text modality has access to the audio context. Although you can do both text and audio modalities in one session, one hasn't access to the context of the other, so yo...

danon · 12-26-2024

Tried to follow your steps but cannot make it work.For context, my codebase is acting as a middleware integrating an external app that streams voice conversations via websockets to Gemini (thanks to my middleware), so Gemini multimodal live API can a...

AppSheet Creators Community

My Stats

danon's Bio

Badges danon Earned

Recent Activity

Summary after audio conversation in Gemini's multimodal live API

Re: Summary after audio conversation in Gemini's multimodal live API

Re: Summary after audio conversation in Gemini's multimodal live API

Re: Summary after audio conversation in Gemini's multimodal live API

Re: Summary after audio conversation in Gemini's multimodal live API