Remix.run Logo
klntsky 5 hours ago

why not skip the text conversion? is it usable at all?

sohamrj 5 hours ago | parent [-]

gemini embedding 2 converts straight video to vectors. in this case, dashcam clips don't have audio to transcribe and even if they did, it would be useless in the search

password4321 4 hours ago | parent [-]

What are the SoA audio models right now?