| ▲ | gobdovan 2 hours ago | |
Well, the problem is that current LLMs are stateless, so a thousand subjective years is not well-defined. Without continuity of experience, persistent memory, engineered aversive stimuli and without updating weights meaninguflly during the punishment interval, we are merely doing the equivalent of simply updating a model to believe it just suffered a thousand years. Only once we have all these right ingredients we can empirically determine whether a thousand years is excessive, insufficient, or the local optimum for reducing Claude overwriting that damn CSS color palette. | ||