| ▲ | otabdeveloper4 5 days ago | |||||||
It's already possible. Post-training is vastly more important than model size. (There's bigtime diminishing returns with increasing model size.) | ||||||||
| ▲ | plagiarist 5 days ago | parent [-] | |||||||
Is there a size cutoff you would say where diminishing returns really kick in? My experience doesn't disagree, at least. I've been using Qwen for coding locally a bit. It is much better than I thought it would be. But also still falls short in some obvious ways compared to the frontiers. | ||||||||
| ||||||||