| ▲ | yNeolh 6 hours ago | |||||||||||||
That happens in most speech to text systems, even Superwhisper, Monologue and Wispr Flow. I read somewhere it comes from training on YouTube audio and happens when there is silence. I guess it depends on the model but most of them are based on Whisper which has this problem | ||||||||||||||
| ▲ | zugi 6 hours ago | parent | next [-] | |||||||||||||
> I read somewhere it comes from training on YouTube audio Does it also insert "please like & subscribe?" | ||||||||||||||
| ||||||||||||||
| ▲ | mr-wendel 5 hours ago | parent | prev [-] | |||||||||||||
Ha, I also have this happen all the time in response to mouse clicks. When playing with Apple Foundation Models + Whisper I noticed that it happens so often that I had to explicitly filter this out before acting on transcriptions. | ||||||||||||||