WebAssembly binding for llama.cpp - Enabling in-browser LLM inference
Harness LLMs with Multi-Agent Programming
Inference for text-embeddings in Rust, HFOIL Licence.
Gateway streamlines requests to 100+ open & closed source models with a unified API. It is also production-ready with support for caching, fallbacks, retries, timeouts, loadbalancing, and can be edge-deployed for minimum latency.
a toolkit for deploying and serving Large Language Models (LLMs).
SGLang is a fast serving framework for large language models and vision language models.
Formerly langchain-ChatGLM, local knowledge based LLM (like ChatGLM) QA app with langchain.
Your email address will not be published. Required fields are marked *
Comment *
Name *
Email *
Website
Captcha: 15 - 12 = ?*
Save my name, email, and website in this browser for the next time I comment.
Harness LLMs with Multi-Agent Programming