Remix clone Hacker News

new | show | ask | jobs Github

	▲	octember 5 hours ago
		Cool idea, but keyframes are not videos. Motion, object permanence, are not things Claude can infer from a set of images. Nice demo though!
	▲	sawjet 3 hours ago \| parent \| next [-]
		I have been going through this with claude and qwenvl3:8b this week. Both are pretty decent at inferring context and analyzing contact sheets. Finding high visual interest moments with a mixture of coarse and fine keyframes.
	▲	fzysingularity 3 hours ago \| parent \| prev [-]
		Exactly! We experimented with a whole bunch of video encoding techniques for LLMs here: https://vlm-run.github.io/mm/encoders/#video