Remix.run Logo
olalonde 4 hours ago

Would be cool to have a benchmark with actually unsolved math and science questions, although I suspect models are still quite a long way from that level.

gowld an hour ago | parent | next [-]

Does folding a protein count? How about increasing performance at Go?

4 hours ago | parent | prev [-]
[deleted]