| ▲ | tombert 4 hours ago | |
It's definitely gotten considerably better, though I still have issues with it generating proofs, at least with TLAPS. I think behind the scenes it's phoning Wolfram Alpha nowadays for a lot of the numeric and algebraic stuff. For all I know, they might even have an Isabelle instance running for some of the even-more abstract mathematics. I agree that this is largely an early ChatGPT problem though, I just thought it was interesting in that they were "plausible" mistakes. I could totally see twelve-year-old tombert making these exact mistakes, so I thought it was interesting that a robot is making the same mistakes an amateur human makes. | ||
| ▲ | tptacek 4 hours ago | parent | next [-] | |
I assumed it was just writing SymPy or something. | ||
| ▲ | CamperBob2 2 hours ago | parent | prev [-] | |
I think behind the scenes it's phoning Wolfram Alpha nowadays for a lot of the numeric and algebraic stuff. For all I know, they might even have an Isabelle instance running for some of the even-more abstract mathematics. Maybe, but they swear they didn't use external tools on the IMO problem set. | ||