Remix.run Logo
SubiculumCode 3 days ago

I've looked up hallucination eval leaderboards, and there doesn't seem to be much besides the vectara [1][2], which doesnt seem to include Claude, and seems to be missing Gemni Pro (non-experimental).

[1] https://huggingface.co/spaces/vectara/leaderboard [2] https://github.com/vectara/hallucination-leaderboard/tree/ma...