Leaderboard
MMedBench
a benchmark that evaluates large language models' ability to answer medical questions across multiple languages.
a benchmark that evaluates large language models' ability to answer medical questions across multiple languages.
a benchmark platform for large language models (LLMs) that features anonymous, randomized battles in a crowdsourced manner.