| ▲ | nearbuy 5 hours ago | ||||||||||||||||
I assume they meant they can't come up with a reasonable justification. | |||||||||||||||||
| ▲ | hyperpape 4 hours ago | parent | next [-] | ||||||||||||||||
Thank you, that's correct. To be perfectly clear, I understand their justification for only _editing_ the executive summary, it is arguably reasonable, because editing the work history would risk altering the details in ways that compromise the measurement. This is a hard problem to solve (you might try reviewing the resumes for hallucinations, but I can't think of a precise study design that doesn't risk problems). What is, imho, impossible to defend, is having the LLM only evaluate the executive summary in isolation, and reporting that as it preferring resumes it wrote. What you've shown is that LLMs prefer executive summaries they wrote. But the overall impact on how they will evaluate your entire resume is not measured by this technique. Worse, this isn't just "decent paper, bad summary", their abstract misreports their findings. | |||||||||||||||||
| |||||||||||||||||
| ▲ | delusional 5 hours ago | parent | prev [-] | ||||||||||||||||
I doubt it since they, admittedly, didn't read it. The question he posed, about the paper, is answered in that very same paper. He has structured his whole reply to have the tone of uncovering the hidden caveat in the small print that invalidates the paper, when it's actually a straightforwardly stated assumption in their methodology section. | |||||||||||||||||
| |||||||||||||||||