Remix.run Logo
user3939382 5 hours ago

“believe” yes in the sense that my program believes x=7. Actually when it goes to read it maybe the bit flipped. Everything on machines is probabilistic that’s a tautology. However we have windowed bounds on valid output, and Claude being able to build a context in which its next decisions are trained on it being an angry vengeful god is not inside that window. That’s what “safe” means, as one of many possible examples.

Inner workings were determined by me, not the LLM. It assisted in generating inputs which had 100% boolean results in the output.