| |
| ▲ | segmondy 5 hours ago | parent | next [-] | | Probably not. Qwen3.(5|6)-27B seems like an "accidental freak". I'm not even sure they know what they did to create that. A decent amount of the team members left after that, so unfortunately, we might not be seeing another small model that packs such a punch for a while. Hopefully the team is studying their entire training recipe for that and is able to replicate. If they are, then a 50-70B dense model might give us such capabilities... | |
| ▲ | Pragmata 7 hours ago | parent | prev [-] | | Yep! I'm running things locally on a RTX5080 + RTX1060 + 64GB DDR5 ram, and would love to get a more capable model if possible! QWEN3.6 27b is pretty good, but i can still notice some spots where it's not as good as the frontier models. |
|