Milestone Papers
ZeRO: Memory Optimizations Toward Training Trillion Parameter Models
(2019-10) ZeRO by Microsoft
(2019-10) ZeRO by Microsoft
(2022-09) Sparrow by DeepMind