Remix.run Logo
nolist_policy 2 days ago

Is distillation or synthetic data used during pre-training? If yes how much?