
Milestone Papers
ZeRO: Memory Optimizations Toward Training Trillion Parameter Models
(2019-10) ZeRO by Microsoft
(2019-10) ZeRO by Microsoft
(2020-01) Scaling Law by OpenAI