Leaderboard
BeHonest
A pioneering benchmark specifically designed to assess honesty in LLMs comprehensively.
A pioneering benchmark specifically designed to assess honesty in LLMs comprehensively.
a Swedish language understanding benchmark that evaluates natural language processing (NLP) models on various tasks such as argumentation analysis, semantic similarity, and textual entailment.