Remix.run Logo
xena 3 days ago

> But for real I don't see a reason why Alexa is not using a good LLM now.

Large language models are too slow to use as real-time voice assistants. ChatGPT voice only barely works because they have to use a much worse (but faster) model to do it.

coredog64 3 days ago | parent | next [-]

Amazon has a commercial Speech-to-Text model (Nova Sonic) that is passable. I used it to create a post-sales call assistant and was surprised that the underlying model was able to do a bunch of stuff I thought I was going to have to use Claude for.

vonneumannstan 3 days ago | parent | prev [-]

At least on paper OpenAI claims the Voice models are actually the ones you are picking i.e. GPT 4o, 5. In any case even a GPT 3.5 would be superior to current alexa...