| ▲ | gslepak 8 hours ago | ||||||||||||||||||||||||||||||||||
Note that these are Python-only results, the model will not do as well with other languages. I'm glad to see more domain-focused SLMs, we need more of them! A programming focused MoE should work well across many languages. | |||||||||||||||||||||||||||||||||||
| ▲ | rcarmo an hour ago | parent | next [-] | ||||||||||||||||||||||||||||||||||
If it writes functional Python instead of cosplaying as a Java programmer and cramming code with classes and accessors, it's already better than Opus... | |||||||||||||||||||||||||||||||||||
| ▲ | nsingh2 7 hours ago | parent | prev [-] | ||||||||||||||||||||||||||||||||||
Lots of confusion about what this model is actually focused on. It is a cheap specialist for closed-world, verifiable reasoning tasks like math, self-contained coding problems, and similar. "Closed-world" means the needed information is already in the context. It is not a tool-using agent that can discover missing context. "Verifiable" means answers are hard to generate but easy to check. So no open ended research, repo wide agent work, factual Q&A, or SVG generation. More of a compact reasoning module for bounded problems. | |||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||