Remix clone Hacker News
new
|
show
|
ask
|
jobs
Github
▲
Measuring AI Ability to Complete Long Tasks (2x every 7 months)
(
metr.org
)
2 points
by
tmoertel
10 hours ago