
Milestone Papers
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models
(2022-06) BIG-bench by Google
(2022-06) BIG-bench by Google
(2024-12) Qwen2.5 by Alibaba