▲ | eevmanu 3 days ago | |
If I understood correctly, VAD has superior results than using ffmpeg silencedetect + silentremove, right? I think latest version of ffmpeg could use whisper with VAD[1], but I still need to explore how with a simple PoC script I'd love to know more about the post-processing prompt, my guess is that looks like an improved version of `semantic correction` prompt[2], but I may be wrong ¯\_(ツ)_/¯ . [1] https://ffmpeg.org/ffmpeg-filters.html#toc-whisper-1 [2] https://gist.github.com/eevmanu/0de2d449144e9cd40a563170b459... |