Remix.run Logo
DrJokepu 11 hours ago

But how do they scale the reviewing of the agentic output? Or they just blindly trust it and worst case scenario they get to write a sob story on HN about how Claude has deleted the production db?

noprocrasted 11 hours ago | parent | next [-]

A company can operate aimlessly for a long time and carry along due to inertia and/or monopoly position. So chances are nobody (competent) is reviewing it.

subhobroto 10 hours ago | parent | prev | next [-]

> But how do they scale the reviewing of the agentic output? Or they just blindly trust it and worst case scenario they get to write a sob story on HN about how Claude has deleted the production db?

Thats a fantastic question. Here's my take: https://news.ycombinator.com/item?id=47917314 - would love your thoughts on it.

In short, I think you're asking a billion dollar question - how do we solve the verification, validation, and QA bottleneck?

The way I handle it for my personal projects is I invest tremendous time and effort into writing thorough test and validation suites.

I bet the next billion dollar companies will be those addressing this verification, validation, and QA bottleneck.

queenkjuul 10 hours ago | parent | prev [-]

Have the agents review their own output, obviously. What could go wrong