▲ | miohtama 20 hours ago | |||||||||||||||||||||||||||||||
I am not expert here, so want to ask what's magical about 405B number? | ||||||||||||||||||||||||||||||||
▲ | daveguy 20 hours ago | parent [-] | |||||||||||||||||||||||||||||||
That's the size of the largest, most capable, open source models. Specifically Llama 3.1 has 405B parameters. Deepseek's largest model is 671B parameters. | ||||||||||||||||||||||||||||||||
|