▲ | daveguy 20 hours ago | ||||||||||||||||||||||
That's the size of the largest, most capable, open source models. Specifically Llama 3.1 has 405B parameters. Deepseek's largest model is 671B parameters. | |||||||||||||||||||||||
▲ | mhitza 20 hours ago | parent [-] | ||||||||||||||||||||||
Small corrections. Llama 3.1 is not an Open Source model, but a Llama 3.1 Licensed model. Neither is DeepSeek apparently https://huggingface.co/deepseek-ai/DeepSeek-V3/blob/main/LIC... which I was of the false opinion that it is. Though I never considered using it, so haven't checked the license before. | |||||||||||||||||||||||
|