
Milestone Papers
ZeRO: Memory Optimizations Toward Training Trillion Parameter Models
(2019-10) ZeRO by Microsoft
(2019-10) ZeRO by Microsoft
(2022-10) GLM-130B by Tsinghua