| ▲ | o10449366 3 hours ago | |||||||
[flagged] | ||||||||
| ▲ | minimaxir 3 hours ago | parent | next [-] | |||||||
"Quirky and obscure" has the functional benefit of ensuring the source question is not in the training data/outside the median user prompt, and therefore making the model less likely to cheat. We have enough people complaining about Simon Willison's pelican test. | ||||||||
| ▲ | Bjartr 2 hours ago | parent | prev | next [-] | |||||||
What would make the prompt a better actual evaluation in your judgement? | ||||||||
| ▲ | tailscaler2026 2 hours ago | parent | prev | next [-] | |||||||
still #opentowork huh | ||||||||
| ||||||||
| ▲ | codemog 3 hours ago | parent | prev [-] | |||||||
Ah yes, also known as C++ enjoyers. | ||||||||