Remix clone Hacker News

new | show | ask | jobs Github

▲

m3kw9 2 hours ago

FrontierCode is likely paid for by anthropic.

▲

lanthissa 2 hours ago | parent | next [-]

did they not pay them enough to get good ratings on the other 3 models?

whats the logic in claiming its a borked metric when everything listed is an anthropic model.

	▲	Narretz 2 hours ago \| parent [-]
		There a few benchmarks out there where all existing models have abysmal scores. So it's not actually a problem if Antrophic's older models are bad, especially if the jump to the newest model is huge, and the competition is also way below it.

▲

reasonableklout 2 hours ago | parent | prev [-]

Huh? It's a benchmark by Cognition which (1) is building their own models and (2) offers all providers and thus has an incentive to avoid hyping up any one too much.

	▲	jstummbillig 2 hours ago \| parent [-]
		But you can just say shit now. Tokens might not be too cheap to meter but saying shit increasingly is.