Remix.run Logo
aabhay 9 hours ago

It handles both video frames and audio-in-video. So if you wanted, you could search for something that was said in a video and it'll find you the exact segment of it!

We don't use transcription or any post processing. We simply embed the file. Our embedding has an additional inner dimension to support long duration content. So it's [N x D] where D is the embed dimension and N is an internal dimension that varies on the content.