Remix.run Logo
CamperBob2 2 days ago

Makes for a fascinating principal/agent problem: which role is the LLM playing? If I just tell it "Try different things until you solve the game", it tries to do just that until it reaches 15 tool calls.

alecf 2 days ago | parent [-]

Yeah made me wonder if you could speedrun the game by giving it a lot of complex instructions and then just let it run...

cloudfudge 2 days ago | parent [-]

It ran for a while when I gave it instructions to do a depth-first search of the known map, while observing any atypical features of every new location and also picking up anything of note. A few times, it asked me if I wanted to continue the search, but I finally told it not to interrupt the search until it had exhausted all new options, which made it run until it said it had reached the maximum number of tool calls (15).