| ▲ | albertwang 4 hours ago | |||||||
great news, this looks great! is it just me, or do most of the english audio samples sound like anime voices? | ||||||||
| ▲ | bityard 2 hours ago | parent | next [-] | |||||||
Well, if you look at the prompts, they are basically told to sound like that. And if you ask me, I think these models were trained on tween fiction podcasts. (My kids listen to a lot of these and dramatic over-acting seems to be the industry standard.) Also, their middle-aged adult with an "American English" accent sounds like any American I've ever met. More like a bad Sean Connery impersonator. | ||||||||
| ▲ | rapind 4 hours ago | parent | prev | next [-] | |||||||
> do most of the english audio samples sound like anime voices? 100% I was thinking the same thing. | ||||||||
| ▲ | reactordev 3 hours ago | parent | prev | next [-] | |||||||
The real value I see is being able to clone a voice and change timbre and characteristics of the voice to be able to quickly generate voice overs, narrations, voice acting, etc. It's superb! | ||||||||
| ▲ | devttyeu 4 hours ago | parent | prev | next [-] | |||||||
Also like some popular youtubers and popular speakers. | ||||||||
| ||||||||
| ▲ | thehamkercat 3 hours ago | parent | prev | next [-] | |||||||
even the Japanese audio samples sound like anime | ||||||||
| ▲ | htrp 3 hours ago | parent | prev [-] | |||||||
subbed audio training data (much better than cc data) is better | ||||||||