[flagged]

minimaxir 3 hours ago | parent | next [-]

"Quirky and obscure" has the functional benefit of ensuring the source question is not in the training data/outside the median user prompt, and therefore making the model less likely to cheat.

We have enough people complaining about Simon Willison's pelican test.

▲

Bjartr 2 hours ago | parent | prev | next [-]

What would make the prompt a better actual evaluation in your judgement?

▲

tailscaler2026 2 hours ago | parent | prev | next [-]

still #opentowork huh

	▲	beepbooptheory an hour ago \| parent [-]
		Where does one even use that hashtag?

▲

codemog 3 hours ago | parent | prev [-]

Ah yes, also known as C++ enjoyers.