Remix clone Hacker News

new | show | ask | jobs Github

	▲	mettamage 3 hours ago
		Do you have a good resource on how to finetune a model like Qwen? I am curious to try it out.
	▲	trilogic 3 hours ago \| parent \| next [-]
		Here is a dataset you can choose from: https://huggingface.co/datasets/Avtrkrb/combined-reasoning-o... Get a 10000 samples from it according to your needs and go for it. The key (in my opinion) is not cutting the Sequence Length among other things. Whatever traditional finetuning repo will do, if your hardware supports it Unsloth is faster.
	▲	verdverm 3 hours ago \| parent \| prev [-]
		Unsloth has good resources