Remix clone Hacker News
new
|
show
|
ask
|
jobs
Github
▲
nolist_policy
2 days ago
Is distillation or synthetic data used during pre-training? If yes how much?