Remix.run Logo
m3kw9 3 hours ago

Grab frames, lower res, classify, combine meta data. Write to sql

iliashad 2 hours ago | parent [-]

Not really. Grab frames, lower res, classify, combine metadata, transcribe the audio, convert those data (text, visual and audio) to embedding, save them over a vector DB and SQL DB. Which helped me to do semantic search, RAG, search using a screenshot of the video to find the exact the moment in the video plus search using an audio file as well. And other features unlocked with vector DB

ingvay7 an hour ago | parent [-]

Really cool work and workflow. strongly prefer this kind of local, open pipeline that i control over a dependency on Adobe tools and lock ins.