LangChain | LLMWay – The Way To LLM

Inference Engines

LangChain

Building applications with LLMs through composability

GitHub

Building applications with LLMs through composability

exllama 2,887

A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.

OpenLLM 11,572

Fine-tune, serve, deploy, and monitor any open-source LLMs in production. Used in production at BentoML for LLMs-based applications.

talkd.ai dialog 403

Simple API for deploying any RAG or LLM that you want adding plugins.

wechat-chatgpt 13,307

Use ChatGPT On Wechat via wechaty

Flash-Attention 18,393

A method designed to enhance the efficiency of Transformer models

LiteChain 419

Lightweight alternative to LangChain for composing LLMs