Remix.run Logo
benatkin 3 hours ago

To save a click, it's just a fancy front end for Whisper plus a weaker CPU-only model. It has a demo video that seems impressive, but the speech is careful to sound casual while having no meaningful flaws that would cause it to mess up. If you want to make a speech to speech tool, which is what this post asks about, it would make more sense to go straight to Whisper.

joshribakoff 2 hours ago | parent | next [-]

I use it, sponsor it, and did a small pr. One of its goals is to be the most “forkable” starting point if i recall. But yes its just voice input. It’s meaningfully better than the mac dictation for me.

tuananh 2 hours ago | parent | prev [-]

you can use gpu too. i have to admit the app is very easy to use and super convenient. kudos to creator

benatkin an hour ago | parent [-]

Yes, and with GPU, it's Whisper, which has been mentioned elsewhere in this article's comments. I mean that handy.computer provides the other option as a fallback for those who can't or don't want to use the GPU.