Playground for devs to finetune & deploy LLMs
Formerly langchain-ChatGLM, local knowledge based LLM (like ChatGLM) QA app with langchain.
Building applications with LLMs through composability
A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.
Comprehensive set of tools for working with local LLMs for various tasks.
Build your own conversational search engine using less than 500 lines of code by LeptonAI.
NVIDIA Framework for LLM Inference(Transitioned to TensorRT-LLM)
Your email address will not be published. Required fields are marked *
Comment *
Name *
Email *
Website
Captcha: 10 + 10 = ?*
Save my name, email, and website in this browser for the next time I comment.
Formerly langchain-ChatGLM, local knowledge based LLM (like ChatGLM) QA app with langchain.