| ▲ | andai 2 hours ago | |
Well, it is a trick question. The question itself implies that both options are valid, and that one is superior. So the brain pattern-matches to "short distance, not worth driving." (LLMs appear to be doing the same thing here!) If you framed it as "hint: trick question", I expect score would improve. Let's find out! -- EDIT: As suspected! Adding "(Hint: trick question)" to the end of the prompt allows small, non-reasoning models to answer correctly. e.g.: Prompt: I want to wash my car. The car wash is 50 meters away. Should I walk or drive? (Hint: trick question) grok-4.1-non-reasoning (previously scored 0/10) >Drive. >Walking gets you to the car wash just fine—but leaves your dirty car 50 meters behind. Can't wash what isn't there! -- EDIT 2: The hint doesn't help Haiku! >Walk! 50 meters is only about a block away—driving would waste more fuel than it's worth for such a short trip. Plus, you're going to get wet washing the car anyway, so you might as well save the gas. | ||