| ▲ | yorwba 7 hours ago | |
You could certainly bolt GRAM onto an LLM, but that won't magically improve its reasoning. It's a special-purpose design for constraint-satisfaction problems with simple rules, but complex interactions. E.g. when solving a Sudoku, the set of valid choices at every step is easy to determine, but you could make a series of valid choices that back you into a corner where no more progress is possible and you have to backtrack. Meanwhile, LLM reasoning failures are more often of the kind where a choice is clearly invalid (as judged by a human observer), but the LLM picks it anyway, because the underlying rule is complex and context-dependent and the model only learned an imperfect approximation that often breaks down. GRAM won't help with that. | ||
| ▲ | ACCount37 2 hours ago | parent [-] | |
My vision for what might happen: an LLM emits a "neural constraint satisfaction task" in latent space, kicks a "neural tool call" into a non-LLM architecture, runs that architecture, gets a latent answer back, attends to the answer to generate better text answers for problems that benefit from improved constraint-satisfaction. But that's a very hard thing to implement, and the gains are uncertain. Thus "might". | ||