Inference Engines
MNN-LLM
A Device-Inference framework, including LLM Inference on device(Mobile Phone/PC/IOT)
A Device-Inference framework, including LLM Inference on device(Mobile Phone/PC/IOT)
Nvidia Framework for LLM Inference