Remix clone Hacker News

new | show | ask | jobs Github

	▲	spongebobstoes 5 hours ago
		the models can accept images directly as tokens. not a description of an image, the actual image itself. yes, the visual intelligence is limited, but they do actually have vision capabilities.