Milestone Papers
Using Deep and Megatron to Train Megatron-Turing NLG 530B, A Large-Scale Generative Language Model
(2022-01) Megatron-Turing NLG by Microsoft&NVIDIA
(2022-01) Megatron-Turing NLG by Microsoft&NVIDIA
(2025-1) DeepSeek-R1 by DeepSeek