| ▲ | tzury 20 hours ago | |
This case study reveals the future of AI-assisted[1] work, far beyond mathematics. It relies on a combination of Humans, LLMs ('General Tools'), Domain-Specific Tools, and Deep Research. It is apparent that the static data encoded within an LLM is not enough; one must re-fetch sources and digest them fresh for the context of the conversation. In this workflow, AlphaEvolve, Aristotle, and LEAN are the 'PhDs' on the team, while the LLM is the Full Stack Developer that glues them all together. [1] If one likes pompous terms, this is what 'AGI' will actually look like. | ||
| ▲ | philipwhiuk 15 hours ago | parent | next [-] | |
Aristotle is already an LLM and LEAN combined. [from the Aristotle paper] > Aristotle integrates three main components: a Lean proof search system, an informal reasoning system that generates and formalizes lemmas, and a dedicated geometry solver. [from elsewhere on how part 2 works] > To address IMO-level complexity, Aristotle employs a natural language module that decomposes hard problems into lists of informally reasoned lemmas. This module elicits high-level proof sketches and supporting claims, then autoformalizes them into Lean for formal proving. The pipeline features iterative error feedback: Lean verification errors are parsed and fed back to revise both informal and formal statements, iteratively improving the formalization and capturing creative auxiliary definitions often characteristic of IMO solutions. | ||
| ▲ | 9u127v89yik2j3 17 hours ago | parent | prev [-] | |
The author is the PhD on the team. Literally not AGI. | ||