|
| ▲ | scott_w 2 hours ago | parent | next [-] |
| But you're ascribing understanding to the LLM, which is not what it's doing. If the LLM understood you, it would realise it's a trick question and, assuming it was British, reply with "You'd drive it because how else would you get it to the car wash you absolute tit." Even the higher level reasoning, while answering the question correctly, don't grasp the higher context that the question is obviously a trick question. They still answer earnestly. Granted, it is a tool that is doing what you want (answering a question) but let's not ascribe higher understanding than what is clearly observed - and also based on what we know about how LLMs work. |
| |
| ▲ | layer8 an hour ago | parent [-] | | > They still answer earnestly. Gemini at least is putting some snark into its response: “Unless you've mastered the art of carrying a 4,000-pound vehicle over your shoulder, you should definitely drive. While 150 feet is a very short walk, it's a bit difficult to wash a car that isn't actually at the car wash!” | | |
| ▲ | Barbing 15 minutes ago | parent [-] | | Marketing plan comes to mind for labs: find AI tells, fix them, & astroturf on socials that only _your_ frontier model reallly understands the world |
|
|
|
| ▲ | DharmaPolice 2 hours ago | parent | prev | next [-] |
| I think a good rule of thumb is to default to assuming a question is asked in good faith (i.e. it's not a trick question). That goes for human beings and chat/AI models. In fact, it's particularly true for AI models because the question could have been generated by some kind of automated process. e.g. I write my schedule out and then ask the model to plan my day. The "go 50 metres to car wash" bit might just be a step in my day. |
| |
| ▲ | vintermann an hour ago | parent [-] | | Rule of thumb for who, humans or chatbots? For a human, who has their own wants and values, I think it makes perfect sense to wonder what on earth made the interlocutor ask that. |
|
|
| ▲ | layer8 an hour ago | parent | prev [-] |
| Therefore the correct response would be to inquire back to clarify the question being asked. |