| ▲ | embedding-shape 3 hours ago | |||||||||||||||||||||||||||||||||||||||||||
> These aren't benchmark items with public answer keys — they're claims real users submitted for verification to a fact-checking platform. Cool. I wonder if anything of this matters when the authors don't disclose exactly how much of their report was written and made with LLMs in the first place? There even is a "11. Ethics & data use" section, and the research is about LLMs being infallible in some ways, yet the usage of LLMs for the production of this report isn't even mentioned once. | ||||||||||||||||||||||||||||||||||||||||||||
| ▲ | kostaj 3 hours ago | parent [-] | |||||||||||||||||||||||||||||||||||||||||||
Data collection and processing was done manually. LLMs helped with the report drafting. Everything was human reviewed before publishing. | ||||||||||||||||||||||||||||||||||||||||||||
| ||||||||||||||||||||||||||||||||||||||||||||