GPT-NeoX
Training Frameworks
GPT-NeoX

An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.

An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.

Relevant Sites

Leave a Reply

Your email address will not be published. Required fields are marked *