torchtitan | LLMWay – The Way To LLM

Training Frameworks

torchtitan

A native PyTorch Library for large model training.

GitHub

A native PyTorch Library for large model training.

Relevant Sites

torchtune 5,581

A Native-PyTorch Library for LLM Fine-tuning.

Megatron-DeepSpeed 2,188

DeepSpeed version of NVIDIA's Megatron-LM that adds additional support for several features such as MoE model training, Curriculum Learning, 3D Parallelism, and others.

Mesh Tensorflow 1,620

Mesh TensorFlow: Model Parallelism Made Easier.

Megatron-LM 14,140

Ongoing research training transformer models at scale.

BMTrain 613

Efficient Training for Big Models.

DeepSpeed 40,634

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Relevant Sites

Leave a Reply Cancel reply