Remix clone Hacker News

new | show | ask | jobs Github

	▲	solenoid0937 5 hours ago
		https://marginlab.ai/trackers/claude-code-historical-perform...
	▲	taylorfinley 3 hours ago \| parent [-]
		Surely they are testing their optimizations against common benchmarks internally? I bet the "real world task" degradation is larger by some multiple than it appears when measured through a benchmark that is part of the target.