| ▲ | tylervigen 3 hours ago | |
I don’t think the current title (“GPT-5 outperforms federal judges in legal reasoning experiment”) fits. The authors use the title “Silicon Formalism: Rules, Standards, and Judge AI” and explicitly point out that the judges were likely making intentional value judgement calls that drove much of the difference. | ||