Remix.run Logo
isoprophlex 7 hours ago

I just yeeted a bunch of extremely noisy fragments into elevenlabs, and it came out pretty good on their cheap $5 plan. If you're after this for your own amusement, let me know if you want a screencap, or a dump of the source files.

Obv no clean room reconstruction but good enough for personal use...

sigmoid10 6 hours ago | parent [-]

I have lots of super high quality, clean audio recordings from her ripped from an old video game that she did voice work for. I've tried various TTS models over the years with it. Getting the pitch and tune is easy, but getting the impersonal detached robot-y feeling is kinda tricky. But I haven't tried in the past 6 months, so maybe it's time to give it another shot.

isoprophlex 5 hours ago | parent [-]

https://github.com/jarombouts/star-trek-voice-clone

audio files sourced from https://www.trekcore.com/audio/

the inflection and impersonal feel is definitely hard to get right. there are parameters in the elevenlabs API docs to make the voice more stable (= monotonous; see speak.sh in that repo) but still the voice cloner on my $5 plan doesn't really get it right.

nevertheless... i'm still having a lot of fun with this.

edit: if I am forced to rot my brain with the 10x productivity boosting slop gun, at least I'll do it grinning

     > pod cleaned up. waiting on the behemoth to finish grinding through Italy.
     < if only postgres had progress indicators

       ... then they coulda called it progresql
     > lmaooo
     > Bash(~/speak.sh "Joke detected. Humor subroutine engaged. Ha. Ha. Ha.")