Remix clone Hacker News
new
|
show
|
ask
|
jobs
Github
▲
sgt101
3 hours ago
How to know if one should fine tune/pretrain or RL / reasoning train given some data set?