| ▲ | sleepyeldrazi 3 hours ago | ||||||||||||||||
From a quick and shallow view of the paper, it looks very feasible (with a little tinkering ) to be adapted to qwen3.6 27B. The process looks somewhat similar to training a LoRA, or in a way distilling your own model so that a mini model learns how to imitate it, and you glue them. I might bite the bullet and rent a gpu to do it for 3.6 27b, as this will solve a lot of my problems. | |||||||||||||||||
| ▲ | sleepyeldrazi 3 hours ago | parent [-] | ||||||||||||||||
Scratch that, I don't have that kind of money, and 3.5's architecture is a little more divergent from 3's, so it will be a bit less trivial. It does look possible, just not on a student's paycheck. | |||||||||||||||||
| |||||||||||||||||