▲ | ilaksh 2 days ago | ||||||||||||||||||||||||||||||||||
I think this was true late 2023 or early 2024, but not necessarily in mid 2025 for most tasks (as long as they require some AI and aren't purely automation) and you use SOTA LLMs. I used to build the way most of his examples are just functions calling LLMs. I found it almost necessary due to poor tool selection etc. But I think the leading edge LLMs like Gemini 2.5 Pro and Claude 4 are smart enough and good enough at instruction following and tool selection that it's not necessarily better to create workflows. I do have a checklist tool and delegate command and may break tasks down into separate agents though. But the advantage of creating instructions and assigning tool commands, especially if you have an environment with a UI where it is easy to assign tool commands to agents and otherwise define them, is that it is more flexible and a level of abstraction above something like a workflow. Even for visual workflows it's still programming which is more brittle and more difficult to dial in. This was not the case 6-12 months ago and doesn't apply if you insist on using inferior language models (which most of them are). It's really only a handful that are really good at instruction following and tool use. But I think it's worth it to use those and go with agents for most use cases. The next thing that will happen over the following year or two is going to be a massive trend of browser and computer use agents being deployed. That is again another level of abstraction. They might even incorporate really good memory systems and surely will have demonstration or observation modes that can extract procedures from humans using UIs. They will also learn (record) procedural details for optimization during exploration from verbal or written instructions. | |||||||||||||||||||||||||||||||||||
▲ | clbrmbr a day ago | parent | next [-] | ||||||||||||||||||||||||||||||||||
I agree that the strongest agentic models (Claude Opus 4 in particular) change the calculus. They still need good context, but damn are they good at reaching for the right tool. | |||||||||||||||||||||||||||||||||||
▲ | bonzini 2 days ago | parent | prev [-] | ||||||||||||||||||||||||||||||||||
The techniques he has in the post are mostly "model your problem as a data flow graph and follow it". If you skip the modeling part and rely on something that you don't control being good enough, that's faith not engineering. | |||||||||||||||||||||||||||||||||||
|