Remix clone Hacker News
new
|
show
|
ask
|
jobs
Github
▲
Confidence estimation is a better metric than agreement for LLM judges
(
arxiv.org
)
3 points
by
rapiddev
8 hours ago