Remix clone Hacker News

new | show | ask | jobs Github

	▲	ChadNauseam 3 days ago
		> Anthropic claims Mythos is in a class of its own, the evidence corroborates this and the government believes it. They didn't release Mythos, they released Fable, which was Mythos + a classifier that detected potentally-dangerous prompts and blocked them. Everyone who used it noticed how aggressive the classifier was. It would trigger constantly over totally innocent stuff.
	▲	catigula 3 days ago \| parent [-]
		A classifier that was exposed as non-efficacious for a product touted as having extremely dangerous capabilities. I can generate hacks trivially by asking any model to fix open source code. Let’s not pretend you get to have your cake and eat it too.