Milestone Papers
Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism
(2019-09) Megatron-LM by NVIDIA
(2019-09) Megatron-LM by NVIDIA
(2022-06) Emergent Abilities by Google