Remix clone Hacker News

new | show | ask | jobs Github

	▲	embedding-shape 6 hours ago
		Or distilled models, or just slightly smaller models but same architecture. Lots of options, all of them conveniently fitting inside "optimizing inferencing".