Remix clone Hacker News

new | show | ask | jobs Github

	▲	ainch 7 hours ago
		I would expect to see a significant wall clock improvement if that was the case - Meta's Coconut paper was ~3x faster than tokenspace chain-of-thought because latents contain a lot more information than individual tokens. Separately, I think Anthropic are probably the least likely of the big 3 to release a model that uses latent-space reasoning, because it's a clear step down in the ability to audit CoT. There has even been some discussion that they accidentally "exposed" the Mythos CoT to RL [0] - I don't see how you would apply a reward function to latent space reasoning tokens. [0]: https://www.lesswrong.com/posts/K8FxfK9GmJfiAhgcT/anthropic-...