LiteChain | LLMWay – The Way To LLM

Inference Engines

LiteChain

Lightweight alternative to LangChain for composing LLMs

GitHub

Lightweight alternative to LangChain for composing LLMs

mistral.rs 6,077

Blazingly fast LLM inference.

Serge 5,750

a chat interface crafted with llama.cpp for running Alpaca models. No API keys, entirely self-hosted!

FastChat 39,085

A distributed multi-model LLM serving system with web UI and OpenAI-compatible RESTful APIs.

Flash-Attention 19,451

A method designed to enhance the efficiency of Transformer models

DeepSpeed-Mii 2,053

MII makes low-latency and high-throughput inference, similar to vLLM powered by DeepSpeed.

Langfuse 16,073

Open Source LLM Engineering Platform 🪢 Tracing, Evaluations, Prompt Management, Evaluations and Playground.