Remix clone Hacker News

new | show | ask | jobs Github

	▲	kraakf06 3 hours ago
		False positives like this are probably more damaging than the guardrails themselves. If engineers can't predict when a model will switch behavior, it becomes difficult to trust it in production workflows.
	▲	catlifeonmars 29 minutes ago \| parent [-]
		> “trust it in production workflows” What degree of predictability is required? I imagine the bar is pretty low if you trust the previous models in the same contexts.