| ▲ | mettamage 3 hours ago | |
Do you have a good resource on how to finetune a model like Qwen? I am curious to try it out. | ||
| ▲ | trilogic 3 hours ago | parent | next [-] | |
Here is a dataset you can choose from: https://huggingface.co/datasets/Avtrkrb/combined-reasoning-o... Get a 10000 samples from it according to your needs and go for it. The key (in my opinion) is not cutting the Sequence Length among other things. Whatever traditional finetuning repo will do, if your hardware supports it Unsloth is faster. | ||
| ▲ | verdverm 3 hours ago | parent | prev [-] | |
Unsloth has good resources | ||