Leaderboard
Berkeley Function-Calling Leaderboard
evaluates LLM's ability to call external functions/tools.
evaluates LLM's ability to call external functions/tools.
a large-scale question-answering benchmark focused on real-world financial data, integrating both tabular and textual information.