| ▲ | emp17344 5 hours ago | |||||||
Hold your horses, that’s a long way off. The best math AI tool we currently have, Aletheia, was only able to solve 13 out of 700 attempted open Erdos problems, only 4 of which were solved autonomously: https://arxiv.org/html/2601.22401v3 Clearly, these models still struggle with novel problems. | ||||||||
| ▲ | slibhb 4 hours ago | parent [-] | |||||||
> Clearly, these models still struggle with novel problems. Do they struggle with novel problems more or less than humans? | ||||||||
| ||||||||