new | show | ask | jobs Github

voxadam a day ago

Am I correct in understanding that Whisper is a speech recognition AI model originally created by OpenAI?

https://en.wikipedia.org/wiki/Whisper_(speech_recognition_sy...

▲

Maxious a day ago | parent | next [-]

yep, there's a c++ implementation to run it https://github.com/ggml-org/whisper.cpp

▲

oezi a day ago | parent [-]

Isn't WhisperX the canonical choice for running Whisper?

	▲	0points a day ago \| parent \| next [-]
		While whisper and whisperx is python implementations, the whisper.cpp wins the benchmarks.
	▲	sampullman a day ago \| parent \| prev [-]
		Maybe for running locally? whisper.cpp is nice because you can embed it pretty easily in apps for various targets like iOS, OSX, Android, wasm, etc.

▲

johnisgood a day ago | parent | prev | next [-]

Yes.

From the documentation:

> It runs automatic speech recognition using the OpenAI's Whisper model.

▲

voxadam a day ago | parent [-]

Thanks, I was being tripped up by DDOS protection on code.ffmpeg.org for a minute and couldn't read the patch. The combo of Firefox and the fact that Quantum/Lumen/CenturyLink seems to get off by rotating my dynamic IP for no reason occasionally triggers various DDOS protections schemes.

	▲	johnisgood 19 hours ago \| parent [-]
		No problem. :) Yeah, it took me 8 seconds to get through. It seems your issue was worse.

▲

acidburnNSA a day ago | parent | prev | next [-]

Yes, according to the comments in the patch, you are correct.

▲

a day ago | parent | prev | next [-]

[deleted]

▲

cess11 a day ago | parent | prev | next [-]

Kind of, it's a family of audio transcription models.

https://huggingface.co/search/full-text?q=whisper

▲

AlienRobot a day ago | parent | prev | next [-]

I think so, if I remember correctly PotPlayer also supports it for automatic subtitling.

▲

kwar13 a day ago | parent | prev [-]

yes.