Remix clone Hacker News

new | show | ask | jobs Github

	▲	ttul 5 hours ago
		DeepSWE “feels” like the right benchmark in comparison to Artificial Analysis indices and other coding benchmarks. And by their metrics, GPT-5.5 is still king in token efficiency, speed, and overall intelligence per dollar. https://deepswe.datacurve.ai/ Fable 5 is cool and all, but we have not yet seen GPT-5.6.