Remix.run Logo
Xx_crazy420_xX 6 hours ago

Autoresearch is nothing new, big players are already in the game with more sophisticated solutions:

  - https://arxiv.org/abs/2602.02660 (MARS)
  - https://arxiv.org/abs/2601.14525 (Execution-grounded automated AI research)
  - https://arxiv.org/abs/2601.10402 (ML-Master 2.0)
The mostly used benchmark for automated AI engineering/ research is: https://github.com/openai/mle-bench
bluequbit 2 hours ago | parent [-]

The thing is, autoresearch feels more accessible that the listed solutions. I can use it trivially on virtually any problem that has verifiable rewards and a feedback loop.

baxtr an hour ago | parent [-]

People underestimate UX and accessibility. The iPhone was nothing new.