llama.cpp | LLMWay – The Way To LLM

Inference Engines

llama.cpp

LLM inference in C/C++.

GitHub

LLM inference in C/C++.

Relevant Sites

LLocalSearch 5,940

Locally running websearch using LLM chains

Opik 11,327

Confidently evaluate, test, and ship LLM applications with a suite of observability tools to calibrate language model outputs across your dev and production lifecycle.

exllama 2,887

A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.

LangChain 111,601

Building applications with LLMs through composability

Swiss Army Llama 1,019

Comprehensive set of tools for working with local LLMs for various tasks.

AI Gateway 8,935

Gateway streamlines requests to 100+ open & closed source models with a unified API. It is also production-ready with support for caching, fallbacks, retries, timeouts, loadbalancing, and can be edge-deployed for minimum latency.

Relevant Sites

Leave a Reply Cancel reply