IntelliServer | LLMWay – The Way To LLM

Inference Engines

IntelliServer

simplifies the evaluation of LLMs by providing a unified microservice to access and test multiple AI models.

GitHub

simplifies the evaluation of LLMs by providing a unified microservice to access and test multiple AI models.

Relevant Sites

TensorRT-LLM 11,571

Nvidia Framework for LLM Inference

FastChat 39,085

A distributed multi-model LLM serving system with web UI and OpenAI-compatible RESTful APIs.

DeepSpeed-Mii 2,053

MII makes low-latency and high-throughput inference, similar to vLLM powered by DeepSpeed.

FasterTransformer 6,300

NVIDIA Framework for LLM Inference(Transitioned to TensorRT-LLM)

Robocorp 564

Create, deploy and operate Actions using Python anywhere to enhance your AI agents and assistants. Batteries included with an extensive set of libraries, helpers and logging.

Relevant Sites

Leave a Reply Cancel reply