▲ | kelvinjps 4 hours ago | |||||||||||||
Google should have the needed tech for good AI transcription, why the don't integrate them in their auto-captioning? and instead the offer those crappy auto subtitles | ||||||||||||||
▲ | briga 4 hours ago | parent | next [-] | |||||||||||||
Are they crappy though? Most of the time it gets things right, even if they aren't as accurate as a human. And sure, they probably have better techniques for this, but are they cost-effective to run at YouTube-scale? I think their current solution is good enough for most purposes, even if it isn't perfect | ||||||||||||||
| ||||||||||||||
▲ | summerlight an hour ago | parent | prev [-] | |||||||||||||
YT is using USM, which is supposed to be their SOTA ASR model. Gemini have much better linguistic knowledge, but it's likely prohibitively expensive to be used on all YT videos uploaded everyday. But this "correction" approach seems to be a nice cost-effective methodology to apply LLM indeed. |