Wllama | LLMWay – The Way To LLM

Inference Engines

Wllama

WebAssembly binding for llama.cpp - Enabling in-browser LLM inference

GitHub

WebAssembly binding for llama.cpp - Enabling in-browser LLM inference

Relevant Sites

Langroid 3,693

Harness LLMs with Multi-Agent Programming

Text-Embeddings-Inference 4,004

Inference for text-embeddings in Rust, HFOIL Licence.

AI Gateway 9,346

Gateway streamlines requests to 100+ open & closed source models with a unified API. It is also production-ready with support for caching, fallbacks, retries, timeouts, loadbalancing, and can be edge-deployed for minimum latency.

SGLang 17,877

SGLang is a fast serving framework for large language models and vision language models.

Langchain-Chatchat 36,071

Formerly langchain-ChatGLM, local knowledge based LLM (like ChatGLM) QA app with langchain.

Relevant Sites

Leave a Reply Cancel reply