Remix.run Logo
ramshanker 7 hours ago

Do we get any model architecture details like parameter size etc.? Few months back, we used to talk more on this, now it's mostly about model capabilities.

Davidzheng 7 hours ago | parent [-]

I'm honestly not sure what you mean? The frontier labs have kept arch as secrets since gpt3.5

willis936 5 hours ago | parent [-]

At the very least gemini 3's flyer claims 1T parameters.