Remix.run Logo
ashenke 3 hours ago

I use their Voxtral Mini STT audio model to automatically transcribe my podcasts into markdown. Out of all the STT models I've tried, it's both the best performing and one of the cheapest! It's really accurate, feeding the episode notes and the podcast description ensures all names are properly spelled, and speaker diarization works really great. (I just do a Gemini flash pass at the end to identify the speakers, so it shows the host name instead of "Speaker 1")