| ▲ | v_CodeSentinal 4 hours ago | |
This is the classic 'plausible hallucination' problem. In my own testing with coding agents, we see this constantly—LLMs will invent a method that sounds correct but doesn't exist in the library. The only fix is tight verification loops. You can't trust the generative step without a deterministic compilation/execution step immediately following it. The model needs to be punished/corrected by the environment, not just by the prompter. | ||
| ▲ | SubiculumCode an hour ago | parent | next [-] | |
Honestly, I feel humans are similar. It's the generator <-> executive loop that keeps things right | ||
| ▲ | zoho_seni 4 hours ago | parent | prev [-] | |
I've been using codex and never had a compile time error by the time it finishes. Maybe add to your agents to run TS compiler, lint and format before he finish and only stop when all passes. | ||