Remix.run Logo
fragmede 5 days ago

The paper itself is fairly popular, with several thousand citations.

Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer

Noam Shazeer, Azalia Mirhoseini, Krzysztof Maziarz, Andy Davis, Quoc Le, Geoffrey Hinton, Jeff Dean

https://arxiv.org/abs/1701.06538