| ▲ | gordonhart 6 hours ago | |||||||
Modern reasoning models are actually pretty good at arithmetic and almost certainly would have caught this error if asked. Source: we benchmark this sort of stuff at my company and for the past year or so frontier models with a modest reasoning budget typically succeed at arithmetic problems (except for multiplication/division problems with many decimal places, which this isn't). | ||||||||
| ▲ | RobotToaster 5 hours ago | parent [-] | |||||||
Interesting, how have you found they have been performing at more complex things like calculus and analysis? | ||||||||
| ||||||||