Remix clone Hacker News

new | show | ask | jobs Github

	▲	larodi 5 hours ago
		I would prefer GroundingDINo which is a sort of SAM and Dino combo which does open vocabulary.
	▲	geuis 2 hours ago \| parent [-]
		Doesn't work for my use-case. GroundingDINO is a text to bounding box model. SAM2 supports coordinate based masks (user taps or clicks somewhere in an image), which is what my research app needs.