| ▲ | solenoid0937 11 hours ago | |
This isn't "risk seeking" in the least, it's not a security thing. You can always come back later if the design was wrong - you can't undo 7 wasted hours because you didn't notice an agent ask a question at the beginning of the day. A year ago everyone thought that humans should review every agent command before executing them. Every major company realized that was dumb and now most enterprise agents run in a sandbox without being babysat. This is just an extension of the same principle. As models get better you can trust them to make the right architectural choice more often. The likelihood of a human missing a prompt or not wanting to babysit the agent is higher than the agent choosing the wrong option. | ||