Leaderboard
MMedBench
a benchmark that evaluates large language models' ability to answer medical questions across multiple languages.
a benchmark that evaluates large language models' ability to answer medical questions across multiple languages.
a biomedical question-answering benchmark designed for answering research-related questions using PubMed abstracts.