| ▲ | ilaksh 4 hours ago | |
It's a given that the dense models with comparable size are better. I also proved that in my use case for those two Qwen 3.5 models. The benchmarks show 3.6 is a bit better than 3.5. I should retry my task, but I don't have a lot of confidence. But it does sound like they worked on the right thing which is getting closer to the 27B performance. | ||