| ▲ | in-silico 8 hours ago | |
Additionally, maybe it's easier for a model to realize that it doesn't know the answer when the question is easier. If Opus gets all but the hardest questions right, it might have a higher hallucination rate because the questions it gets wrong are the questions where verification or hallucination detection are the most difficult | ||