Remix clone Hacker News

new | show | ask | jobs Github

	▲	a24j 3 hours ago
		Can you share the agent-comparison harness code or point to something similar? I want to learn about benchmarking models in a basic or practical sense.