talkd.ai dialog | LLMWay – The Way To LLM

Inference Engines

talkd.ai dialog

Simple API for deploying any RAG or LLM that you want adding plugins.

GitHub

Simple API for deploying any RAG or LLM that you want adding plugins.

Wllama 930

WebAssembly binding for llama.cpp - Enabling in-browser LLM inference

Flash-Attention 20,422

A method designed to enhance the efficiency of Transformer models

magentic 2,378

Seamlessly integrate LLMs as Python functions

Swiss Army Llama 1,031

Comprehensive set of tools for working with local LLMs for various tasks.

SGLang 20,062

SGLang is a fast serving framework for large language models and vision language models.

Langfuse 18,086

Open Source LLM Engineering Platform 🪢 Tracing, Evaluations, Prompt Management, Evaluations and Playground.