Inference Engines
talkd.ai dialog
Simple API for deploying any RAG or LLM that you want adding plugins.
Simple API for deploying any RAG or LLM that you want adding plugins.
WebAssembly binding for llama.cpp - Enabling in-browser LLM inference