Remix.run Logo
orbital-decay 2 hours ago

Training on the CoT itself is pretty dubious since it's reward hacked to some degree (as evident from e.g. GLM-4.7 which tried pulling that with 3.0 Pro, and ended up repeating Model Armor injections without really understanding/following them). In any case they aren't trying to hide it particularly hard.

FergusArgyll 2 hours ago | parent [-]

> In any case they aren't trying to hide it particularly hard.

What does that mean? Are you able to read the raw cot? how?