Remix clone Hacker News

new | show | ask | jobs Github

	▲	cocogoatmain 30 minutes ago
		Want to also add that the model doesn’t know how to respond in a user-> assistant style conversation after it’s pretraining, and it’s a pure text predictor (look at the open source base models) There’s also what is being called mid-training where the model is trained on high(er) quality traces and acts as a bridge between pre and post training