▲ | sigmoid10 4 days ago | |||||||
As the other commenter already pointed out, I'll believe it when I see it on the leaderboard. But even then it already lost twice against the winner of last year's competition, because that too was a general purpose LLM that could also do other things. | ||||||||
▲ | bubblyworld 4 days ago | parent [-] | |||||||
Let's not move the goalposts here =) I don't think it's really fair to compare them directly like that. But I agree, this is triggering my "too good to be true" reflex very hard. | ||||||||
|