Leaderboard
DreamBench++
a benchmark for evaluating the performance of large language models (LLMs) in various tasks related to both textual and visual imagination.
a benchmark for evaluating the performance of large language models (LLMs) in various tasks related to both textual and visual imagination.
An Automatic Evaluator for Instruction-following Language Models using Nous benchmark suite.