Remix.run Logo
ideashower 2 days ago

A side project that takes legal documents and uses TTS models to create a narrated read out of the whole document.

Part of the reason I'm building my own solution is that legal documents are often distributed in PDFs which can have all kinds of formatting issues when converted to plain text. There's also specific jargon and formatting that may or may not need to be included, or spoken, or even spoken differently, that I am finding no commercial TTS platform like ElevenLabs really accounts for well. It's all about the pre-processing and chunking.

Also, the commercial models are expensive when you're routinely throwing dozens of pages of text at it.