| ▲ | Xx_crazy420_xX 6 hours ago | |||||||
Autoresearch is nothing new, big players are already in the game with more sophisticated solutions:
The mostly used benchmark for automated AI engineering/ research is:
https://github.com/openai/mle-bench | ||||||||
| ▲ | bluequbit 2 hours ago | parent [-] | |||||||
The thing is, autoresearch feels more accessible that the listed solutions. I can use it trivially on virtually any problem that has verifiable rewards and a feedback loop. | ||||||||
| ||||||||