BMTrain | LLMWay – The Way To LLM

Training Frameworks

BMTrain

Efficient Training for Big Models.

GitHub

Efficient Training for Big Models.

Relevant Sites

NeMo Framework 15,734

Generative AI framework built for researchers and PyTorch developers working on Large Language Models (LLMs), Multimodal Models (MMs), Automatic Speech Recognition (ASR), Text to Speech (TTS), and Computer Vision (CV) domains.

torchtune 5,504

A Native-PyTorch Library for LLM Fine-tuning.

GPT-NeoX 7,309

An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.

Megatron-DeepSpeed 2,162

DeepSpeed version of NVIDIA's Megatron-LM that adds additional support for several features such as MoE model training, Curriculum Learning, 3D Parallelism, and others.

Transformer Engine 2,733

A library for accelerating Transformer model training on NVIDIA GPUs.

Megatron-LM 13,662

Ongoing research training transformer models at scale.

Relevant Sites

Leave a Reply Cancel reply