Finetuning takes little resources, the base model training is the slow and expensive part. Architecturally 3.5 models are identical to their 3.6 counterparts, that is why there is a consensus that those are probably finetunes and not re-trained from scratch, like you will se many people publish their own on huggingface.

▲

genxy 3 hours ago | parent [-]

Understood, but look at their larger cadence over the years and the breadth of models. They are clearly not all finetunes. Meta for all its billions, doesn't have anything comparable.

	▲	bachmeier 14 minutes ago \| parent \| next [-]
		> Meta for all its billions, doesn't have anything comparable. Maybe nothing released to the public. I don't know that all of their models are public. I think all they really care about is that they aren't relying on one or two cloud providers for a critical piece of their infrastructure.
	▲	Computer0 25 minutes ago \| parent \| prev [-]
		competent leadership goes a long way