Remix.run Logo
vunderba 3 days ago

I probably should've clarified that by infectious energy I wasn't so much referring to the vocal aspect as I was the overall quality, interaction between the hosts, and pithiness / wit.

Having experimented with many LLMs (mixtral, sonnet, ChatGPT, Llama, etc.), the coherence is for the most part on point, but their capacity for novelty has been found wanting irrespective of how I tuned the top_k, temperature, or prompts.

That being said, I've seen some very impressive examples of style transference even conveying emotional range in some of the SOTA TTS systems.