
Milestone Papers
GLaM: Efficient Scaling of Language Models with Mixture-of-Experts
(2021-12) GLaM by Google
(2021-12) GLaM by Google
(2023-12) Mamba by CMU&Princeton