| ▲ | spankalee 5 hours ago | |||||||||||||||||||||||||
This happens with all agents I've used and package.json files for npm. Instead of using `npm i foo` the agent string-edits package.json and hallucinates some version to install. Usually it's a kind of ok version, but it's not how I would like this to work. It's worse with renaming things in code. I've yet to see an agent be able to use refactoring tools (if they even exist in VS Code) instead of brute-forcing renames with string replacement or sed. Agents use edit -> build -> read errors -> repeat, instead of using a reliable tool, and it burns a lot more GPU... | ||||||||||||||||||||||||||
| ▲ | embedding-shape 4 hours ago | parent | next [-] | |||||||||||||||||||||||||
> This happens with all agents I've used and package.json files for npm. Instead of using `npm i foo` the agent string-edits package.json and hallucinates some version to install. When using codex, I usually have something like `Never add 3rd party libraries unless explicitly requested. When adding new libraries, use `cargo add $crate` without specifying the version, so we get the latest version.` and it seems to make this issue not appear at all. | ||||||||||||||||||||||||||
| ||||||||||||||||||||||||||
| ▲ | threecheese 2 hours ago | parent | prev | next [-] | |||||||||||||||||||||||||
For the first, I think maintaining package-add instructions is table stakes, we need to be opinionated here. Agents are typically good at following them, if not you can fall over to a Makefile that does everything. For the second, I totally agree. I continue to hope that agents will get better at refactoring, and I think using LSPs effectively would make this happen. Claude took dozens of minutes to perform a rename which Jetbrains would have executed perfectly in like five seconds. Its approach was to make a change, run the tests, do it again. Nuts. | ||||||||||||||||||||||||||
| ▲ | root_axis 2 hours ago | parent | prev | next [-] | |||||||||||||||||||||||||
> brute-forcing renames with string replacement That's their strategy for everything the training data can't solve. This is the main reason the autonomous agent swarm approach doesn't work for me. 20 bucks in tokens just obliterated with 5 agents exchanging hallucinations with each-other. It's way too easy for them to amplify each other's mistakes without a human to intervene. | ||||||||||||||||||||||||||
| ▲ | richardw 4 hours ago | parent | prev | next [-] | |||||||||||||||||||||||||
Totally. Surely the IDE’s like antigravity are meant to give the LLM more tools to use for eg refactoring or dependency management? I haven’t used it but seems a quick win to move from token generation to deterministic tool use. | ||||||||||||||||||||||||||
| ||||||||||||||||||||||||||
| ▲ | kittbuilds an hour ago | parent | prev [-] | |||||||||||||||||||||||||
[dead] | ||||||||||||||||||||||||||