Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity
Milestone Papers
Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity

(2021-01) Switch Transformers by Google

(2021-01) Switch Transformers by Google

Relevant Sites

Leave a Reply

Your email address will not be published. Required fields are marked *