lighteval | LLMWay – The Way To LLM

Evaluation

lighteval

a lightweight LLM evaluation suite that Hugging Face has been using internally.

GitHub

a lightweight LLM evaluation suite that Hugging Face has been using internally.

simple-evals 4,195

Eval tools by OpenAI.

Giskard 4,997

Testing & evaluation library for LLM applications, in particular RAGs

MixEval 253

A reliable click-and-go evaluation suite compatible with both open-source and proprietary models, supporting MixEval and other benchmarks.

a unified platform from LangChain framework for: evaluation, collaboration HITL (Human In The Loop), logging and monitoring LLM applications.

OLMO-eval 370

a repository for evaluating open language models.

Ragas 11,585

a framework that helps you evaluate your Retrieval Augmented Generation (RAG) pipelines.