Remix.run Logo
Void_ 2 days ago

I the past month or so, I added 2 models to my app Whisper Memos (https://whispermemos.com):

- Cohere Transcribe (self hosted)

- Grok Speech To Text (they provide an API, only $0.10/hr!)

They are both excellent. I'm not sure about this one. Would you like to see it in a consumer speech to text app?

olejorgenb 2 days ago | parent | next [-]

I've had good experiences with the Mistral Voxtral models (I've used the API, but some of the model-variants are open weight)

Barbing 2 days ago | parent | prev | next [-]

Does Cohere work with longer transcripts? Do you have to do some magic to merge recordings over 35 seconds long?

2ndorderthought 2 days ago | parent | prev | next [-]

Have you tried qwen?

SecretDreams 2 days ago | parent | prev [-]

Any non-Musk alternatives that are comparable in quality and cost?

jayphen 2 days ago | parent | next [-]

Voxtral competes on price ($0.003/min) and quality. Speechmatics has best in class accuracy but is a bit more expensive ($0.004/min)

Void_ 2 days ago | parent | prev [-]

Our default is still OpenAI Whisper. Grok is just a choice for users who might prefer it.