▲ | easygenes 3 days ago | ||||||||||||||||||||||||||||||||||
Other benchmark aggregates are less favorable to GPT-OSS-120B: https://arxiv.org/abs/2508.12461 | |||||||||||||||||||||||||||||||||||
▲ | petesergeant 3 days ago | parent [-] | ||||||||||||||||||||||||||||||||||
With all these things, it depends on your own eval suite. gpt-oss-120b works as well as o4-mini over my evals, which means I can run it via OpenRouter on Cerebras where it's SO DAMN FAST and like 1/5th the price of o4-mini. | |||||||||||||||||||||||||||||||||||
|