Remix clone Hacker News

new | show | ask | jobs Github

	▲	pianopatrick 3 hours ago
		Do you think a similar approach would work with smaller models, like 1.5B models?
	▲	zambelli 3 hours ago \| parent [-]
		I would expect so! I'm currently running Gemma 4 E4B evals and it's behaving the same. Better with guardrails. There might be a floor where any error nudge confuses the model more than helps, but I haven't found it across many 8B families and now Gemma 4 E4B.