| ▲ | ainch 12 hours ago | |
I don't think it's a good comparison given Inception work on software and Cerebras/Groq work on hardware. If Inception demonstrate that diffusion LLMs work well at scale (at a reasonable price) then we can probably expect all the other frontier labs to copy them quickly, similarly to OpenAI's reasoning models. | ||
| ▲ | refulgentis 12 hours ago | parent [-] | |
Definitely depends on what you're buying, maybe some of the audience here was buying Groq and Cerebras chips? I don't think they sold them but can't say for sure. If you're a poor schmoke like me, you'd be thinking of them as API vendors of ~1000 token/s LLMs. Especially because Inception v1's been out for a while and we haven't seen a follow-the-leader effect. Coincidentally, that's one of my biggest questions: why not? | ||