AI Gateway | LLMWay – The Way To LLM

Inference Engines

AI Gateway

Gateway streamlines requests to 100+ open & closed source models with a unified API. It is also production-ready with support for caching, fallbacks, retries, timeouts, loadbalancing, and can be edge-deployed for minimum latency.

GitHub

Relevant Sites

Infinity 2,535

Inference for text-embeddings in Python

talkd.ai dialog 425

Simple API for deploying any RAG or LLM that you want adding plugins.

Robocorp 585

Create, deploy and operate Actions using Python anywhere to enhance your AI agents and assistants. Batteries included with an extensive set of libraries, helpers and logging.

SGLang 20,062

SGLang is a fast serving framework for large language models and vision language models.

Wllama 930

WebAssembly binding for llama.cpp - Enabling in-browser LLM inference

FastChat 39,241

A distributed multi-model LLM serving system with web UI and OpenAI-compatible RESTful APIs.

Relevant Sites

Leave a Reply Cancel reply