An open-source GPU cluster manager for running LLMs
Inference for text-embeddings in Python
Nvidia Framework for LLM Inference
A distributed multi-model LLM serving system with web UI and OpenAI-compatible RESTful APIs.
Framework to create ChatGPT like bots over your dataset.
LLM inference in C/C++.
First LLM Multi-agent framework.
Your email address will not be published. Required fields are marked *
Comment *
Name *
Email *
Website
Captcha: 12 + 11 = ?*
Save my name, email, and website in this browser for the next time I comment.
Inference for text-embeddings in Python