
Milestone Papers
An empirical analysis of compute-optimal large language model training
(2022-04) Chinchilla by DeepMind
(2022-04) Chinchilla by DeepMind
(2019-09) Megatron-LM by NVIDIA