| ▲ | genxy 3 hours ago |
| How good must their training pipelines be? Releasing publicly and at this rate has made them very efficient. |
|
| ▲ | sleepyeldrazi 3 hours ago | parent [-] |
| Finetuning takes little resources, the base model training is the slow and expensive part. Architecturally 3.5 models are identical to their 3.6 counterparts, that is why there is a consensus that those are probably finetunes and not re-trained from scratch, like you will se many people publish their own on huggingface. |
| |
| ▲ | genxy 3 hours ago | parent [-] | | Understood, but look at their larger cadence over the years and the breadth of models. They are clearly not all finetunes. Meta for all its billions, doesn't have anything comparable. | | |
| ▲ | bachmeier 14 minutes ago | parent | next [-] | | > Meta for all its billions, doesn't have anything comparable. Maybe nothing released to the public. I don't know that all of their models are public. I think all they really care about is that they aren't relying on one or two cloud providers for a critical piece of their infrastructure. | |
| ▲ | Computer0 25 minutes ago | parent | prev [-] | | competent leadership goes a long way |
|
|