Remix.run Logo
sgt101 3 hours ago

How to know if one should fine tune/pretrain or RL / reasoning train given some data set?