WebAssembly binding for llama.cpp - Enabling in-browser LLM inference
A python package for Txt-to-SQL with self hosting functionalities and RESTful APIs compatible with proprietary as well as open source LLM.
Get up and running with Llama 3, Mistral, Gemma, and other large language models.
A method designed to enhance the efficiency of Transformer models
Confidently evaluate, test, and ship LLM applications with a suite of observability tools to calibrate language model outputs across your dev and production lifecycle.
AI gateway and marketplace for developers, enables streamlined integration of AI features into products
A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.
Your email address will not be published. Required fields are marked *
Comment *
Name *
Email *
Website
Captcha: 17 + 15 = ?*
Save my name, email, and website in this browser for the next time I comment.
A python package for Txt-to-SQL with self hosting functionalities and RESTful APIs compatible with proprietary as well as open source LLM.