These are simply benchmaxxed versions of either Qwen or Gemma 4.

If so, it's impressive they managed to benchmaxx Qwen even further than it's already benchmaxxed.

	▲	v3ss0n 3 hours ago \| parent [-]
		Nah , they just put graphs with different color prioritizing themselves.

jorisw 4 hours ago | parent | prev [-]

Citation needed

	▲	S0y 3 hours ago \| parent [-]
		Sure. https://deep-reinforce.com/ornith_1_0.html >Built on top of pretrained Gemma 4 and Qwen 3.5, it achieves state-of-the-art performance among open-source models of comparable size on coding benchmarks. >Ornith-1.0 is a self-improving training framework. Instead of relying on human-designed harnesses to drive solution generation in RL, Ornith-1.0 learns to generate both solution rollouts and the task-specific harnesses that guide those rollouts.