Remix clone Hacker News
new
|
show
|
ask
|
jobs
Github
▲
epolanski
9 hours ago
I think that they are simply evaluated on prompt to solution benchmarks.