Infinity | LLMWay – The Way To LLM

Inference Engines

Infinity

Inference for text-embeddings in Python

GitHub

Inference for text-embeddings in Python

Relevant Sites

LiteChain 421

Lightweight alternative to LangChain for composing LLMs

FasterTransformer 6,344

NVIDIA Framework for LLM Inference(Transitioned to TensorRT-LLM)

llama.cpp 89,419

LLM inference in C/C++.

Serge 5,754

a chat interface crafted with llama.cpp for running Alpaca models. No API keys, entirely self-hosted!

QA-Pilot 313

An interactive chat project that leverages Ollama/OpenAI/MistralAI LLMs for rapid understanding and navigation of GitHub code repository or compressed file resources.

promptfoo 9,007

Test your prompts. Evaluate and compare LLM outputs, catch regressions, and improve prompt quality.

Relevant Sites

Leave a Reply Cancel reply