▲ | stolencode 4 days ago | |||||||
> For example achieving 66.7% on the AIME 2024 dataset. We worked _really_ hard, burned _tons_ of cash, and we're proud of our D- output. No wonder there are more papers published than actual work being done. | ||||||||
▲ | supermdguy 4 days ago | parent | next [-] | |||||||
That corresponds to a 10/15, which is actually really good (median is around 6) https://artofproblemsolving.com/wiki/index.php/AMC_historica... | ||||||||
| ||||||||
▲ | jpcompartir 4 days ago | parent | prev [-] | |||||||
This is a nonsense critique. Modest results are worth publishing, as are bad results. |