▲ | CJefferson 4 days ago | ||||||||||||||||
As someone who has written and graded a lot of University exams, I'm sure a decent number of students would write the wrong answer to that. A bunch of students would write 5 (adding all the numbers). Others would write "3 apples and 2 cats", which is technically not what I'm looking for (but personally I would give full marks for, some wouldn't). Many students clear try to answer exams by pattern matching, and I've seen a lot of exams of students "matching" on a pattern based on one word on a question and doing something totally wrong. | |||||||||||||||||
▲ | jonathanlydall 4 days ago | parent | next [-] | ||||||||||||||||
Many professionals with lower skilled jobs sometimes lean too heavily on pattern matching too. For example, customer service reps tend to often vaguely match your request with a possibly or only vaguely applicable templated response. Technically savvy customers who tend to try explain problems in detail are probably more likely to get an actually non-applicable canned response as the CS rep gets frustrated with the amount of information and will latch onto the first phrase which relates to a templated response without really considering context. My reply’s getting a little tangential now, but I feel this is good life advice, I’ve found I’m more likely to get decent customer service if I keep my requests as short as possible. The first sentence needs to essentially state the issue I need help with. In some cases a bulleted list of things I’ve tried helps and then I’m sure to include essential info like an account number, e.g. I’m getting error 13508 when I try log into my account. I’ve already tried the following solutions with no success: - Clearing my browser cache and cookies. - Restarting my computer. - Running all software updates. My account number: xxx What is the next step here? | |||||||||||||||||
| |||||||||||||||||
▲ | jaccola 4 days ago | parent | prev | next [-] | ||||||||||||||||
Parents whole point is contrary to this (they agree with you), the context didn't even include numbers to pattern match on! | |||||||||||||||||
| |||||||||||||||||
▲ | kazinator 4 days ago | parent | prev | next [-] | ||||||||||||||||
When you try wing your way through a question by pattern matching, then you are not applying intelligence. Your interests lie elsewhere and so you are just fumbling your way through the activity at hand just to get through it. | |||||||||||||||||
| |||||||||||||||||
▲ | viccis 4 days ago | parent | prev [-] | ||||||||||||||||
I agree that poor test takers are easily distracted, and this is the reason that "word problems" are heavily emphasized in preparation for tests like the SAT or state proficiency exams. But in general I do not think these models are claiming at being good at replicating the performance of a distracted or otherwise low performing pupil. I think they should be evaluated against humans who are capable of completing word problems containing context that is not inherently necessary to the math question. The reason those tests I mentioned use these word problems is that it's a way to evaluate someone's ability to think in abstract mathematical terms about everyday situations, which obviously involve lots of unimportant information the person must choose to consider or not. tl;dr: I think a reasonably competent high school student could answer the apple and cat question, which is absolutely a reasonable bar for an LLM to clear. If university students are failing these questions, then they have not been taught test taking skills, which should be considered a mathematical failure just as unacceptable as that of the LLM, not a mitigating similarity for the latter. |