
Inference Engines
FastChat
A distributed multi-model LLM serving system with web UI and OpenAI-compatible RESTful APIs.
A distributed multi-model LLM serving system with web UI and OpenAI-compatible RESTful APIs.
Create, deploy and operate Actions using Python anywhere to enhance your AI agents and assistants. Batteries included with an extensive set of libraries, helpers and logging.