Remix.run Logo
S0y 5 hours ago

These are simply benchmaxxed versions of either Qwen or Gemma 4.

2001zhaozhao 3 hours ago | parent | next [-]

If so, it's impressive they managed to benchmaxx Qwen even further than it's already benchmaxxed.

v3ss0n 3 hours ago | parent [-]

Nah , they just put graphs with different color prioritizing themselves.

jorisw 4 hours ago | parent | prev [-]

Citation needed

S0y 3 hours ago | parent [-]

Sure. https://deep-reinforce.com/ornith_1_0.html

>Built on top of pretrained Gemma 4 and Qwen 3.5, it achieves state-of-the-art performance among open-source models of comparable size on coding benchmarks.

>Ornith-1.0 is a self-improving training framework. Instead of relying on human-designed harnesses to drive solution generation in RL, Ornith-1.0 learns to generate both solution rollouts and the task-specific harnesses that guide those rollouts.