
Milestone Papers
Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism
(2019-09) Megatron-LM by NVIDIA
(2019-09) Megatron-LM by NVIDIA
(2021-12) Retro by DeepMind