Remix.run Logo
o10449366 3 hours ago

[flagged]

minimaxir 3 hours ago | parent | next [-]

"Quirky and obscure" has the functional benefit of ensuring the source question is not in the training data/outside the median user prompt, and therefore making the model less likely to cheat.

We have enough people complaining about Simon Willison's pelican test.

Bjartr 2 hours ago | parent | prev | next [-]

What would make the prompt a better actual evaluation in your judgement?

tailscaler2026 2 hours ago | parent | prev | next [-]

still #opentowork huh

beepbooptheory an hour ago | parent [-]

Where does one even use that hashtag?

codemog 3 hours ago | parent | prev [-]

Ah yes, also known as C++ enjoyers.