▲ | Morizero 15 hours ago | |||||||
You don't happen to know a whisper solution that combines diarization with live audio transcription, do you? | ||||||||
▲ | peterleiser 10 hours ago | parent | next [-] | |||||||
Check out https://github.com/jhj0517/Whisper-WebUI I ran it last night using docker and it worked extremely well. You need a HuggingFace read-only API token for the Diarization. I found that the web UI ignored the token, but worked fine when I added it to docker compose as an environment variable. | ||||||||
▲ | jduckles 15 hours ago | parent | prev | next [-] | |||||||
WhipserX's diarization is great imo:
Works a treat for Zoom interviews. Diarization is sometimes a bit off, but generally its correct. | ||||||||
| ||||||||
▲ | kmfrk 15 hours ago | parent | prev [-] | |||||||
Proper diarization still remains a white whale for me, unfortunately. Last I looked into it, the main options required API access to external services, which put me off. I think it was pyannotate.audio[1]. | ||||||||
|