Remix clone Hacker News

new | show | ask | jobs Github

	▲	skissane 5 hours ago
		An LLM agent could be given a tool for self-finetuning… it could construct a training dataset, use it to build a LORA/etc, and then use the LORA for inference… that’s getting closer to your ideal