| ▲ | iammjm 7 hours ago | |||||||||||||
Why doesn’t it just call tools such as Mathematica for such operations? | ||||||||||||||
| ▲ | ACCount37 4 hours ago | parent | next [-] | |||||||||||||
For the same reason you don't run "4+6" on a calculator. External tool call has an overhead. It requires a round trip into an external tool. It requires an LLM to run in agentic autoregression - it can't be used in prefill. Which means that having native arithmetic capabilities is useful. Forward pass arithmetics are an LLM version of quick mental math. An LLM can read "#define SILLY_TIME_CONST (3*20*60*60*1000)" and have "SILLY_TIME_CONST is 60 h expressed as 216000000 ms" already cached by the end of the line, before it even emits its first token. | ||||||||||||||
| ||||||||||||||
| ▲ | defrost 6 hours ago | parent | prev | next [-] | |||||||||||||
This is more how an LLM thinks about math internally - an LLM version of drilled tables being used for mental arithmetic "as humans do". When humans stall on these tasks, they reach for pen and paper, a slide rule, a calculator, etc. Mathematica is overkill for arithmetic, in addition it's licenced and can cost a bit extra. If an LLM were to reach for a light cheap arithmetic tool something like bc would be a good first stop - a CLI tool with a language that supports arbitrary precision numbers with interactive execution of statements. | ||||||||||||||
| ▲ | jampekka 5 hours ago | parent | prev | next [-] | |||||||||||||
They do. I asked CharGPT for 327 x 48 and it used the "ChatGPT Instruments" calculator. Previously it used to run Python scripts, and may still do for more complex calculations. | ||||||||||||||
| ▲ | breezybottom 2 hours ago | parent | prev [-] | |||||||||||||
ChatGPT does, and has since 2023 | ||||||||||||||