Leaderboard
LLMEval
focuses on understanding how these models perform in various scenarios and analyzing results from an interpretability perspective.
focuses on understanding how these models perform in various scenarios and analyzing results from an interpretability perspective.
a large-scale question-answering benchmark focused on real-world financial data, integrating both tabular and textual information.