Remix.run Logo
zargon 3 days ago

GPT OSS 120B only has 5B active parameters. GP specifically said dense models, not MoE.