| ▲ | dharma1 3 hours ago | ||||||||||||||||
Yes the voice part of OpenAI realtime/voice mode is great but it’s pretty dumb compared to newer models and often gets stuck repeating itself. Google’s Gemini flash live 3.1 is better, especially used via the API - it can do tool calling (including to other, even smarter LLMs if you set it up yourself), you can set the reasoning level (even high is still close enough to realtime) and it can ground answers in google search. I love bidirectional voice and right now it’s probably the best option. You can try it in AI studio | |||||||||||||||||
| ▲ | Lucasoato 3 hours ago | parent [-] | ||||||||||||||||
Thanks, I’ll try it, even if my experience wasn’t that great with Google models lately (503s) | |||||||||||||||||
| |||||||||||||||||