▲ | wkcheng 6 days ago | ||||||||||||||||
Does this support using the Parakeet model locally? I'm a MacWhisper user and I find that Parakeet is way better and faster than Whisper for on-device transcription. I've been using push-to-transcribe with MacWhisper through Parakeet for a while now and it's quite magical. | |||||||||||||||||
▲ | braden-w 6 days ago | parent | next [-] | ||||||||||||||||
Not yet, but I want it too! Parakeet looks incredible (saw that leaderboard result). My current roadmap is: finish stabilizing whisper.cpp integration, then add Parakeet support. If anyone has bandwidth to PR the connector, I’d be thrilled to merge it. | |||||||||||||||||
| |||||||||||||||||
▲ | daemonologist 6 days ago | parent | prev | next [-] | ||||||||||||||||
Parakeet is amazing - 3000x real-time on an A100 and 5x real-time even on a laptop CPU, while being more accurate than whisper-large-v3 (https://huggingface.co/spaces/hf-audio/open_asr_leaderboard). NeMo is a little awkward though; I'm amazed it runs locally on Mac (for MacWhisper). | |||||||||||||||||
| |||||||||||||||||
▲ | warangal 5 days ago | parent | prev | next [-] | ||||||||||||||||
A bit tangential statement, about parakeet and other Nvidia Nemo models, i never found actual architecture implementations as pytorch/tf code, seems like all such models, are instant-ized from a binary blob making it difficult to experiment! Maybe i missed something, does anyone here have more experience with .nemo models to shed some more light onto this? | |||||||||||||||||
▲ | polo 6 days ago | parent | prev | next [-] | ||||||||||||||||
+1 for MacWhisper. Very full featured, nice that it's a one time purchase, and the developer is constantly improving it. | |||||||||||||||||
▲ | mark212 6 days ago | parent | prev [-] | ||||||||||||||||
seems like "not yet" is the answer from other comments |