Remix.run Logo
simonw an hour ago

Reasoning models with access to Python have been able to solve 4th grade math homework for over a year now. Prove me wrong: show me a 4th grade math problem they can't handle.

otabdeveloper4 an hour ago | parent [-]

> show me a 4th grade math problem they can't handle

Sure.

"8 7 6 5 4 3 2 1 - add minus signs and parenthesis to get 31."

P.S. There is an answer online and some LLMs will just copy it verbatim. This doesn't count.

simonw 36 minutes ago | parent [-]

Whoa, 4th grade math problems got hard! I'm not sure how I'd tackle that one myself.