torchtune | LLMWay – The Way To LLM

Training Frameworks

torchtune

A Native-PyTorch Library for LLM Fine-tuning.

GitHub

A Native-PyTorch Library for LLM Fine-tuning.

Relevant Sites

Transformer Engine 2,733

A library for accelerating Transformer model training on NVIDIA GPUs.

Colossal-AI 41,163

Making large AI models cheaper, faster, and more accessible.

DeepSpeed 40,162

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Megatron-LM 13,662

Ongoing research training transformer models at scale.

NeMo Framework 15,734

Generative AI framework built for researchers and PyTorch developers working on Large Language Models (LLMs), Multimodal Models (MMs), Automatic Speech Recognition (ASR), Text to Speech (TTS), and Computer Vision (CV) domains.

Megatron-DeepSpeed 2,162

DeepSpeed version of NVIDIA's Megatron-LM that adds additional support for several features such as MoE model training, Curriculum Learning, 3D Parallelism, and others.

Relevant Sites

Leave a Reply Cancel reply