
Inference Engines
FasterTransformer
NVIDIA Framework for LLM Inference(Transitioned to TensorRT-LLM)
NVIDIA Framework for LLM Inference(Transitioned to TensorRT-LLM)
Playground for devs to finetune & deploy LLMs