| ▲ | regularfry 2 days ago | |||||||
Thinking vs non-thinking. There'll be a token cost there. But still fairly remarkable! | ||||||||
| ▲ | DoctorOetker 2 days ago | parent [-] | |||||||
Is there a reason we can't use thinking completions to train non-thinking? i.e. gradient descent towards what thinking would have answered? | ||||||||
| ||||||||