Remix clone Hacker News

new | show | ask | jobs Github

	▲	wren6991 2 days ago
		> you could do substantially the same thing by just prompting/LoRA finetuning the model to produce a single-token output ("yes" or "no") You could probably achieve this with logit masking. Or equivalently, comparing the "yes" vs "no" logprobs in the final dis-embedded vector.