| ▲ | veselin 3 hours ago | ||||||||||||||||
Here, it appears they compare a single prompt "find IDOR", against a multi-agent system. However, one can also start far more sophisticated skills that spin up subagents and mostly do the same in Claude Code, Codex, OpenCode, Pi, etc. Which I guess makes what semgrep sells obsolete. Unless they have built a pareto-optimal point in terms of capabilities and token usage maybe? | |||||||||||||||||
| ▲ | blazespin 3 hours ago | parent [-] | ||||||||||||||||
I think the point is less "how can we throw shade on the OP" and more "a harness can enable a lot of models to do very serious cybersec, glm 5.2 is one of them" | |||||||||||||||||
| |||||||||||||||||