I wanted to build my own speech-to-text transcription program [1] for Discord, similar to how zoom or google hangouts works. I built it so that I can record my group's DND sessions and build applications / tools for VTTs (Virtual TableTop gaming).

It can process a set of 3-hour audio files in ~20 mins.

I recorded a demo video of how it works here: https://www.youtube.com/watch?v=v0KZGyJARts&t=300s

[1] https://github.com/naveedn/audio-transcriber

I alluded to building this tool on a previous HN thread: https://news.ycombinator.com/item?id=45338694

▲

rudderdev 12 hours ago | parent [-]

I have found a hack. If you wait long enough, someone will build what you wanted to build :)

Thanks for building this. I am trying to set it up but facing this issu

> `torch` (v2.3.1) only has wheels for the following platforms: `manylinux1_x86_64`, `manylinux2014_aarch64`, `macosx_11_0_arm64`, `win_amd64`

	▲	nvdnadj92 11 hours ago \| parent [-]
		Ah lovely! I’d be happy to assist, create an issue on GitHub and we can go from there!