
Inference Engines
FasterTransformer
NVIDIA Framework for LLM Inference(Transitioned to TensorRT-LLM)
NVIDIA Framework for LLM Inference(Transitioned to TensorRT-LLM)
Use ChatGPT On Wechat via wechaty