| ▲ | PashaGo 2 days ago | |
It would be nice to see some metrics. I think the missing layer here is evaluation. If agents are going to produce applications, the platform needs not only guardrails, but public-ish evidence that those guardrails actually catch failures | ||
| ▲ | owulveryck a day ago | parent | next [-] | |
I fully agree | ||
| ▲ | raicursive 2 days ago | parent | prev [-] | |
[dead] | ||