| ▲ | XCSme 7 hours ago | |
Oh, or you meant a smaller model than GLM-5.2 with similar capabilities? | ||
| ▲ | segmondy 5 hours ago | parent | next [-] | |
Probably not. Qwen3.(5|6)-27B seems like an "accidental freak". I'm not even sure they know what they did to create that. A decent amount of the team members left after that, so unfortunately, we might not be seeing another small model that packs such a punch for a while. Hopefully the team is studying their entire training recipe for that and is able to replicate. If they are, then a 50-70B dense model might give us such capabilities... | ||
| ▲ | Pragmata 7 hours ago | parent | prev [-] | |
Yep! I'm running things locally on a RTX5080 + RTX1060 + 64GB DDR5 ram, and would love to get a more capable model if possible! QWEN3.6 27b is pretty good, but i can still notice some spots where it's not as good as the frontier models. | ||