| ▲ | upcoming-sesame an hour ago | |
I also look for this reply because i like seeing the follow-up reply saying that this is not a benchmark anymore because labs have gotten it in their training data. that reply never failed to come it's basically a meme at this point | ||