Remix.run Logo
_puk an hour ago

> So maybe the AI labs have been paying attention after all!

> I think this mainly demonstrates that the pelican on the bicycle has firmly exceeded its limits as a useful benchmark.

As acknowledged in the article.

kzrdude 15 minutes ago | parent [-]

Gemini 3.1 basically takes it home on that benchmark, anyway, it's done.