Remix.run Logo
cratermoon 2 days ago

https://thebullshitmachines.com/lesson-16-the-first-step-fal...

Philpax 2 days ago | parent [-]

This doesn't seem to really address synthetic data, let alone RL-based reasoning.

cratermoon 2 days ago | parent [-]

Why would it? Once those are introduced, advancement leaves behind pure scaling.