Remix.run Logo
jjcm 2 hours ago

This is probably less likely with this model, as it’s almost certainly a further RL training continuation of 3.5 27b. The bugs with this architecture were worked out when that dropped.

originalvichy 2 hours ago | parent [-]

Valuable note!