| ▲ | artemisart 5 hours ago | |
Hill climbing doesn't mean much but absolutely doesn't imply they cheat on benchmarks. They have more details here https://microsoft.ai/news/introducing-mai-thinking-1/ it seems to be "RL on everything". | ||
| ▲ | 5 hours ago | parent [-] | |
| [deleted] | ||