Mesh Tensorflow | LLMWay – The Way To LLM

Training Frameworks

Mesh Tensorflow

Mesh TensorFlow: Model Parallelism Made Easier.

GitHub

Mesh TensorFlow: Model Parallelism Made Easier.

Relevant Sites

Megatron-DeepSpeed 2,162

DeepSpeed version of NVIDIA's Megatron-LM that adds additional support for several features such as MoE model training, Curriculum Learning, 3D Parallelism, and others.

GPT-NeoX 7,309

An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.

NeMo Framework 15,734

Generative AI framework built for researchers and PyTorch developers working on Large Language Models (LLMs), Multimodal Models (MMs), Automatic Speech Recognition (ASR), Text to Speech (TTS), and Computer Vision (CV) domains.

torchtune 5,504

A Native-PyTorch Library for LLM Fine-tuning.

BMTrain 609

Efficient Training for Big Models.

veRL 13,605

veRL is a flexible and efficient RL framework for LLMs.

Relevant Sites

Leave a Reply Cancel reply