LLMWay – The Way To LLM LLMWay – The Way To LLM
  • LLM Trends
    • Leaderboard
  • LLM Models
  • LLM Learning
    • Milestone Papers
  • LLM Inference
    • Inference Engines
  • LLM Training
    • Training Frameworks
    • Evaluation
  • Home
  • Blog
  • Submit Sites
Oobabooga Benchmark
Leaderboard
Oobabooga Benchmark

Link

A benchmark for LLM

Relevant Sites

InfiBench

a benchmark designed to evaluate large language models (LLMs) specifically in their ability to answer real-world coding-related questions.

TAT-DQA

a large-scale Document Visual Question Answering (VQA) dataset designed for complex document understanding, particularly in financial reports.

SuperBench

a benchmark platform designed for evaluating large language models (LLMs) on a range of tasks, particularly focusing on their performance in different aspects such as natural language understanding, reasoning, and generalization.

AlpacaEval

An Automatic Evaluator for Instruction-following Language Models using Nous benchmark suite.

Chinese Large Model Leaderboard

an expert-driven benchmark for Chineses LLMs.

LiveBench

A Challenging, Contamination-Free LLM Benchmark.

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Copyright © 2025 LLMWay – The Way To LLM