LLM inference in C/C++.
SGLang is a fast serving framework for large language models and vision language models.
Fine-tune, serve, deploy, and monitor any open-source LLMs in production. Used in production at BentoML for LLMs-based applications.
Harness LLMs with Multi-Agent Programming
Formerly langchain-ChatGLM, local knowledge based LLM (like ChatGLM) QA app with langchain.
Framework to create ChatGPT like bots over your dataset.
Build your own conversational search engine using less than 500 lines of code by LeptonAI.
Your email address will not be published. Required fields are marked *
Comment *
Name *
Email *
Website
Captcha: 18 - 16 = ?*
Save my name, email, and website in this browser for the next time I comment.
SGLang is a fast serving framework for large language models and vision language models.