| ▲ | shambu2k 5 hours ago | ||||||||||||||||
Damn, you beat me to it. I was building something similar but got too caught up optimizing the context extraction. I actually ended up building a full spec for it—basically a PoC of "grep for videos." My end goal was to let an agent make semantic changes (e.g., "remove the parts where the guy in the blue dress is seen") by simply grepping the context spec for the relevant timestamps and using ffmpeg to cut them out. How are you extracting context from videos? | |||||||||||||||||
| ▲ | adishj 5 hours ago | parent [-] | ||||||||||||||||
how would this be different from vector embeddings / semantic search? | |||||||||||||||||
| |||||||||||||||||