▲ | anonymoushn 2 days ago | ||||||||||||||||
"subliminal learning" does not even work for use cases like distilling o1 to R1 because they do not share a base model | |||||||||||||||||
▲ | pyman 2 days ago | parent [-] | ||||||||||||||||
Who's talking about that? [Edit] My bad, I thought I was commenting on Anthropic's article | |||||||||||||||||
|