
Milestone Papers
Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism
(2019-09) Megatron-LM by NVIDIA
(2019-09) Megatron-LM by NVIDIA
(2018-10) BERT by Google