Leaderboard
DreamBench++
a benchmark for evaluating the performance of large language models (LLMs) in various tasks related to both textual and visual imagination.
a benchmark for evaluating the performance of large language models (LLMs) in various tasks related to both textual and visual imagination.
a multimodal question-answering benchmark designed to evaluate AI models' cognitive ability to understand human beliefs and goals.