Remix clone Hacker News

new | show | ask | jobs Github

	▲	hyperpape 4 hours ago
		Thank you, that's correct. To be perfectly clear, I understand their justification for only _editing_ the executive summary, it is arguably reasonable, because editing the work history would risk altering the details in ways that compromise the measurement. This is a hard problem to solve (you might try reviewing the resumes for hallucinations, but I can't think of a precise study design that doesn't risk problems). What is, imho, impossible to defend, is having the LLM only evaluate the executive summary in isolation, and reporting that as it preferring resumes it wrote. What you've shown is that LLMs prefer executive summaries they wrote. But the overall impact on how they will evaluate your entire resume is not measured by this technique. Worse, this isn't just "decent paper, bad summary", their abstract misreports their findings.
	▲	delusional an hour ago \| parent [-]
		> Worse, this isn't just "decent paper, bad summary", their abstract misreports their findings. What findings are being misrepresented? Their claims seem supported by their conclusions to me. You can question the generality of their claims based on the limitation of their methods, but that does not amount to "misreporting" the conclusion.