| ▲ | BoxOfRain 13 hours ago | |
I quite like IndexTTS2 personally, it does voice cloning and also lets you modulate emotion manually through emotion vectors which I've found quite a powerful tool. It's not necessarily something everyone needs, but it's really cool technology in my opinion. It's been particularly useful for a model orchestration project I've been working on. I have an external emotion classification model driving both the LLM's persona and the TTS output so it stays relatively consistent. The affect system also influences which memories are retrieved; it's more likely to retrieve 'memories' created in the current affect state. IndexTTS2 was pretty much the only TTS that gives the level of control I felt was necessary. | ||
| ▲ | realityfactchex 6 hours ago | parent [-] | |
Wow, the IndexTTS2 demo is very good. Definitely going to check that out. Thanks. | ||