Remix clone Hacker News

new | show | ask | jobs Github

	▲	cwyers 9 days ago
		So, the way speculative decoding works, the model begins predicting at the first wrong token, so you still get 'is' for free.