| ▲ | nsingh2 7 hours ago | |||||||||||||||||||
Lots of confusion about what this model is actually focused on. It is a cheap specialist for closed-world, verifiable reasoning tasks like math, self-contained coding problems, and similar. "Closed-world" means the needed information is already in the context. It is not a tool-using agent that can discover missing context. "Verifiable" means answers are hard to generate but easy to check. So no open ended research, repo wide agent work, factual Q&A, or SVG generation. More of a compact reasoning module for bounded problems. | ||||||||||||||||||||
| ▲ | nsingh2 5 hours ago | parent | next [-] | |||||||||||||||||||
To follow up on this, I had it solve a nasty ODE problem that I saw in the recent Mathematica 15 release post:
And surprisingly it found a valid solution! Extra impressive because it runs 25 tok/s on my measly RTX 2070 super.
Apparently Mathematica 14.3 couldn't solve this ODE. | ||||||||||||||||||||
| ||||||||||||||||||||
| ▲ | skeledrew 3 hours ago | parent | prev [-] | |||||||||||||||||||
If it can code well then once you put it in a loop with an interpreter it can do anything. | ||||||||||||||||||||