Remix clone Hacker News

new | show | ask | jobs Github

	▲	energy123 2 days ago
		I agree it isn't the main property we care about, we care about reliability. But at least in its theoretical construction the LLM should be deterministic. It outputs a fixed probability distribution across tokens with no rng involvement. We then sample from that fixed distribution non-deterministically for better performance or we use greedy decoding and get slightly worse performance in exchange for full determinism. Happy to be corrected if I am wrong about something.