Remix clone Hacker News

new | show | ask | jobs Github

	▲	in-silico 8 hours ago
		Additionally, maybe it's easier for a model to realize that it doesn't know the answer when the question is easier. If Opus gets all but the hardest questions right, it might have a higher hallucination rate because the questions it gets wrong are the questions where verification or hallucination detection are the most difficult