maxtext | LLMWay – The Way To LLM

Training Frameworks

maxtext

A simple, performant and scalable Jax LLM!

GitHub

A simple, performant and scalable Jax LLM!

Relevant Sites

GPT-NeoX 7,309

An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.

Megatron-LM 13,662

Ongoing research training transformer models at scale.

BMTrain 609

Efficient Training for Big Models.

Mesh Tensorflow 1,615

Mesh TensorFlow: Model Parallelism Made Easier.

NeMo Framework 15,734

Generative AI framework built for researchers and PyTorch developers working on Large Language Models (LLMs), Multimodal Models (MMs), Automatic Speech Recognition (ASR), Text to Speech (TTS), and Computer Vision (CV) domains.

torchtune 5,504

A Native-PyTorch Library for LLM Fine-tuning.

Relevant Sites

Leave a Reply Cancel reply