Remix clone Hacker News

new | show | ask | jobs Github

	▲	skhameneh 7 days ago
		I was talking to an old colleague/friend about distillation, trying to understand how to steer distillation with regards to removing irrelevant regions of a larger model when training a smaller model. He shared this paper with me, calling the works seminal, it appears to be highly relevant: Inference-Time Intervention: Eliciting Truthful Answers from a Language Model https://arxiv.org/pdf/2306.03341