Remix clone Hacker News

new | show | ask | jobs Github

	▲	merlindru 5 hours ago
		surely training also gets cheaper so justifying it becomes easier? i think it'll be more like we get 1-10T models and then distill those down into smaller models, though It seems like the best small models today are all distilled from bigger models Moreover, I hypothesize Claude Opus 4.7 and now 4.8 are a distillation of Claude Mythos