Remix.run Logo
sinuhe69 4 hours ago

I'm pretty certain that DeepMind (and all other labs) will try their frontier (and even private) models on First Proof [1].

And I wonder how Gemini Deep Think will fare. My guess is that it will get half the way on some problems. But we will have to take an absence as a failure, because nobody wants to publish a negative result, even though it's so important for scientific research.

[1] https://1stproof.org/

zozbot234 4 hours ago | parent [-]

The 1st proof original solutions are due to be published in about 24h, AIUI.