mistral.rs | LLMWay – The Way To LLM

Inference Engines

mistral.rs

Blazingly fast LLM inference.

GitHub

Blazingly fast LLM inference.

Relevant Sites

Nanoflow 887

NanoFlow is a throughput-oriented high-performance serving framework for LLMs. NanoFlow consistently delivers superior throughput compared to vLLM, Deepspeed-FastGen, and TensorRT-LLM.

promptfoo 8,361

Test your prompts. Evaluate and compare LLM outputs, catch regressions, and improve prompt quality.

Text-Embeddings-Inference 4,004

Inference for text-embeddings in Rust, HFOIL Licence.

SGLang 17,877

SGLang is a fast serving framework for large language models and vision language models.

Langchain-Chatchat 36,071

Formerly langchain-ChatGLM, local knowledge based LLM (like ChatGLM) QA app with langchain.

Relevant Sites

Leave a Reply Cancel reply