Remix.run Logo
hirako2000 7 days ago

Is there any source you could reference. Really interested.

It would not surprise me, why would they build from scratch, every LLM is a "fork" of gpt. Did they not come up with the mixture of expert idea though ?

bpavuk 7 days ago | parent [-]

and every LLM is a "fork" of Google's Transformers architecture.

everything is a "fork", if you give it a serious thought.