Remix.run Logo
minimaxir 5 hours ago

Golden Gate Claude was two years ago and it's surprising there hasn't been as much research into targeted activations since.

landl0rd 2 hours ago | parent [-]

There’s been some, but naive activation steering makes models dumber pretty reliably and training an SAE is a pretty heavy lift.