| ▲ | genxy 5 hours ago | |
Understood, but look at their larger cadence over the years and the breadth of models. They are clearly not all finetunes. Meta for all its billions, doesn't have anything comparable. | ||
| ▲ | fgonzag an hour ago | parent | next [-] | |
In the china AI scene, there seem to be two separate types of companies. Companies or labs like deepseek that produce less but larger and more innovative models, so seem to be more research oriented. then there are companies like z.ai (GLM), Minimax, and Qwen which focus more on commercializing the AI and so produce far more versions, but with far less improvements between them (usually fine tunes) Commercial providers like anthropic probably do the same thing, maybe even without labeling it like a different version if the model is similiar enough. | ||
| ▲ | bachmeier 3 hours ago | parent | prev | next [-] | |
> Meta for all its billions, doesn't have anything comparable. Maybe nothing released to the public. I don't know that all of their models are public. I think all they really care about is that they aren't relying on one or two cloud providers for a critical piece of their infrastructure. | ||
| ▲ | Computer0 3 hours ago | parent | prev [-] | |
competent leadership goes a long way | ||