
Milestone Papers
Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism
(2019-09) Megatron-LM by NVIDIA
(2019-09) Megatron-LM by NVIDIA
(2023-04) Pythia by EleutherAI et al.