How good must their training pipelines be? Releasing publicly and at this rate has made them very efficient.

Finetuning takes little resources, the base model training is the slow and expensive part. Architecturally 3.5 models are identical to their 3.6 counterparts, that is why there is a consensus that those are probably finetunes and not re-trained from scratch, like you will se many people publish their own on huggingface.

▲

genxy 3 hours ago | parent [-]

Understood, but look at their larger cadence over the years and the breadth of models. They are clearly not all finetunes. Meta for all its billions, doesn't have anything comparable.

	▲	bachmeier 14 minutes ago \| parent \| next [-]
		> Meta for all its billions, doesn't have anything comparable. Maybe nothing released to the public. I don't know that all of their models are public. I think all they really care about is that they aren't relying on one or two cloud providers for a critical piece of their infrastructure.
	▲	Computer0 25 minutes ago \| parent \| prev [-]
		competent leadership goes a long way