| ▲ | sathish316 5 hours ago | |
Does this imply LLMs will not work well on novel reasoning problems? | ||
| ▲ | danpalmer 3 hours ago | parent | next [-] | |
Yep that's the implication. Anecdotally this is obvious to me. I'm using LLMs to write Java and C++, and then can churn out generic plumbing with no issues, but novel code for a novel implementation of a novel idea, they have no idea what they're doing. I'm getting good productivity gains, but it requires a lot of hand holding because AI does not know what it's doing. On far less novel problems I get far better results. | ||
| ▲ | wmf 5 hours ago | parent | prev [-] | |
ARC-AGI is already testing that. | ||