Perfect match for this test: https://arxiv.org/abs/2602.05192
Heres the result [1]
[1] https://www.scientificamerican.com/article/first-proof-is-ai...
This is what everyone who uses llms regularly expected. Good results require a human in the loop and the internet is so big that just about everything has been done there by someone. Most often you.