
Milestone Papers
Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity
(2021-01) Switch Transformers by Google
(2021-01) Switch Transformers by Google
(2020-01) Scaling Law by OpenAI