| ▲ | CubsFan1060 2 days ago | |
Great post last night from Simon: https://simonwillison.net/2026/Apr/27/vibevoice/ | ||
| ▲ | 542458 2 days ago | parent | next [-] | |
Note that this just covers the Speech-to-Text/Speech-Recognition aspect (a-la whisper), there's also models for long-form Text-To-Speech and steaming Text-To-Speech. | ||
| ▲ | JumpCrisscross 2 days ago | parent | prev [-] | |
“VibeVoice can only handle up to an hour of audio” Why? | ||