Remix clone Hacker News

new | show | ask | jobs Github

	▲	sathish316 5 hours ago
		Does this imply LLMs will not work well on novel reasoning problems?
	▲	danpalmer 3 hours ago \| parent \| next [-]
		Yep that's the implication. Anecdotally this is obvious to me. I'm using LLMs to write Java and C++, and then can churn out generic plumbing with no issues, but novel code for a novel implementation of a novel idea, they have no idea what they're doing. I'm getting good productivity gains, but it requires a lot of hand holding because AI does not know what it's doing. On far less novel problems I get far better results.
	▲	wmf 5 hours ago \| parent \| prev [-]
		ARC-AGI is already testing that.