Remix clone Hacker News

new | show | ask | jobs Github

	▲	vessenes 5 hours ago
		Cool! From the prompt it looks like you don’t give the llms a harness to step through games or simulate - is that correct? If so I’d suggest it’s not a level playing field vs human written bots - if the humans are allowed to watch some games that is.
	▲	levmiseri 5 hours ago \| parent [-]
		That’s true, I’m trying to figure out a better testing environment with a feedback loop. I did try letting the models iterate on the bot code based on a summary of an end-of-game ‘report’, but that showed only marginal improvements vs. zero-shot