▲ | daxfohl a day ago | |||||||||||||
Like I said, they can implement the algorithm to solve it, but when forced to maintain the state themselves, either internally or explicitly in the context, they are unable to do so and get lost. Similarly if you ask to write a Sudoku solver, they have no problem. And if you ask an online model to solve a sudoku, it'll write a sudoku solver in the background and use that to solve it. But (at least the last time I tried, a year ago), if you ask to solve step-by-step using pure reasoning without writing a program, they start spewing out all kinds of nonsense (but humorously cheat: they'll still spit out the correct answer at the end). | ||||||||||||||
▲ | prewett 18 hours ago | parent | next [-] | |||||||||||||
That’s because there are lots of maze-solving algorithms on the web, so it’s easy to spit one back at you. But since they don’t actually understand how solve a maze, or even apply an algorithm one step at a time, it doesn’t work well. | ||||||||||||||
▲ | warrenm 19 minutes ago | parent | prev | next [-] | |||||||||||||
you do not need to remember state with the simplest solver: - place your right hand on the right wall - walk forward, never letting your hand leave the wall - arrive at the exit yes, you travel many dead ends along the way but you are guaranteed to get to the end of a 'traditional' maze | ||||||||||||||
▲ | adventured a day ago | parent | prev [-] | |||||||||||||
So if you push eg Claude Sonnet 4 or Opus 4.1 into a maze scenario, and have it record its own pathing as it goes, and then refresh and feed the next Claude the progress so far, would that solve for the inability to maintain long duration context in such maze cases? I make Claude do that on every project. I call them Notes for Future Claude and have it write notes for itself because of how quickly context accuracy erodes. It tends to write rather amusing notes to itself in my experience. | ||||||||||||||
|