Remix clone Hacker News

new | show | ask | jobs Github

	▲	littlestymaar 9 hours ago
		If what you refer to by “on demand training ” is fine tuning, it's going to be much more efficient on a small model than a big one.
	▲	red75prime 8 hours ago \| parent [-]
		LoRA can work with big models. But I mean sample-efficient RL.