| ▲ | DrJokepu 11 hours ago | |
But how do they scale the reviewing of the agentic output? Or they just blindly trust it and worst case scenario they get to write a sob story on HN about how Claude has deleted the production db? | ||
| ▲ | noprocrasted 11 hours ago | parent | next [-] | |
A company can operate aimlessly for a long time and carry along due to inertia and/or monopoly position. So chances are nobody (competent) is reviewing it. | ||
| ▲ | subhobroto 10 hours ago | parent | prev | next [-] | |
> But how do they scale the reviewing of the agentic output? Or they just blindly trust it and worst case scenario they get to write a sob story on HN about how Claude has deleted the production db? Thats a fantastic question. Here's my take: https://news.ycombinator.com/item?id=47917314 - would love your thoughts on it. In short, I think you're asking a billion dollar question - how do we solve the verification, validation, and QA bottleneck? The way I handle it for my personal projects is I invest tremendous time and effort into writing thorough test and validation suites. I bet the next billion dollar companies will be those addressing this verification, validation, and QA bottleneck. | ||
| ▲ | queenkjuul 10 hours ago | parent | prev [-] | |
Have the agents review their own output, obviously. What could go wrong | ||