Remix.run Logo
AI agents still can't solve 1/3 of SWE-Bench problems. Why not? (A Case Study)(surgehq.ai)
1 points by egilliehhc 7 hours ago | 1 comments
TesterVetter 7 hours ago | parent [-]

[dead]