Remix.run Logo
voxadam a day ago

Am I correct in understanding that Whisper is a speech recognition AI model originally created by OpenAI?

https://en.wikipedia.org/wiki/Whisper_(speech_recognition_sy...

Maxious a day ago | parent | next [-]

yep, there's a c++ implementation to run it https://github.com/ggml-org/whisper.cpp

oezi a day ago | parent [-]

Isn't WhisperX the canonical choice for running Whisper?

0points a day ago | parent | next [-]

While whisper and whisperx is python implementations, the whisper.cpp wins the benchmarks.

sampullman a day ago | parent | prev [-]

Maybe for running locally? whisper.cpp is nice because you can embed it pretty easily in apps for various targets like iOS, OSX, Android, wasm, etc.

johnisgood a day ago | parent | prev | next [-]

Yes.

From the documentation:

> It runs automatic speech recognition using the OpenAI's Whisper model.

voxadam a day ago | parent [-]

Thanks, I was being tripped up by DDOS protection on code.ffmpeg.org for a minute and couldn't read the patch. The combo of Firefox and the fact that Quantum/Lumen/CenturyLink seems to get off by rotating my dynamic IP for no reason occasionally triggers various DDOS protections schemes.

johnisgood 19 hours ago | parent [-]

No problem. :) Yeah, it took me 8 seconds to get through. It seems your issue was worse.

acidburnNSA a day ago | parent | prev | next [-]

Yes, according to the comments in the patch, you are correct.

a day ago | parent | prev | next [-]
[deleted]
cess11 a day ago | parent | prev | next [-]

Kind of, it's a family of audio transcription models.

https://huggingface.co/search/full-text?q=whisper

AlienRobot a day ago | parent | prev | next [-]

I think so, if I remember correctly PotPlayer also supports it for automatic subtitling.

kwar13 a day ago | parent | prev [-]

yes.