| ▲ | ghshephard 3 hours ago | ||||||||||||||||||||||
Do any of the open weight models from smaller labs exist if they can't distill from the SoTA models that are throwing billions of dollars of compute into pretraining? | |||||||||||||||||||||||
| ▲ | daniel_iversen 2 hours ago | parent | next [-] | ||||||||||||||||||||||
I’ve been wondering the same. And I think pretty much all the impressive small lab models were guilty of it, right? At least there is still larger players like DeepSeek and mistral to provide a bit of diversity in the market | |||||||||||||||||||||||
| ▲ | username223 2 hours ago | parent | prev [-] | ||||||||||||||||||||||
Does it matter? The frontier models stole the whole internet, then the second-level models stole from them… It’s all theft. | |||||||||||||||||||||||
| |||||||||||||||||||||||