Remix.run Logo
albertwang 4 hours ago

great news, this looks great! is it just me, or do most of the english audio samples sound like anime voices?

bityard 2 hours ago | parent | next [-]

Well, if you look at the prompts, they are basically told to sound like that.

And if you ask me, I think these models were trained on tween fiction podcasts. (My kids listen to a lot of these and dramatic over-acting seems to be the industry standard.)

Also, their middle-aged adult with an "American English" accent sounds like any American I've ever met. More like a bad Sean Connery impersonator.

rapind 4 hours ago | parent | prev | next [-]

> do most of the english audio samples sound like anime voices?

100% I was thinking the same thing.

reactordev 3 hours ago | parent | prev | next [-]

The real value I see is being able to clone a voice and change timbre and characteristics of the voice to be able to quickly generate voice overs, narrations, voice acting, etc. It's superb!

devttyeu 4 hours ago | parent | prev | next [-]

Also like some popular youtubers and popular speakers.

pixl97 3 hours ago | parent [-]

Hmm, wonder where they got their training data from?

thehamkercat 3 hours ago | parent | prev | next [-]

even the Japanese audio samples sound like anime

htrp 3 hours ago | parent | prev [-]

subbed audio training data (much better than cc data) is better