Remix clone Hacker News

new | show | ask | jobs Github

	▲	atleastoptimal 5 days ago
		Absolutely untrue. Claiming GPT-3 hallucinates as much as o3 over the same token horizon on the same prompts is a silly notion and easily disproven by the dozens of benchmarks. You can code a complete web-app with models now, something far beyond the means of models so long ago.
	▲	otabdeveloper4 5 days ago \| parent [-]
		> caveats and weasel words > "benchmarks" Stop drinking the coolaid and making excuses for LLM limitations, and learn to use the tools properly given their limits instead.