Milestone Papers
ZeRO: Memory Optimizations Toward Training Trillion Parameter Models
(2019-10) ZeRO by Microsoft
(2019-10) ZeRO by Microsoft
(2022-01) LaMDA by Google