
Training Frameworks
GPT-NeoX
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
veRL is a flexible and efficient RL framework for LLMs.