(2025-1) DeepSeek-R1 by DeepSeek
(2024-12) Qwen2.5 by Alibaba
(2024-05) Llama3 by Meta
(2024-05) Mamba2 by CMU&Princeton
(2024-01) DeepSeek-v2 by DeepSeek
(2023-12) Mamba by CMU&Princeton
(2023-10) Mistral 7B by Mistral
(2023-07) LLaMA2 by Meta
(2023-05) ToT by Google&Princeton
(2023-05) DPO by Stanford
(2023-05) RWKV by Bo Peng
(2023-05) PaLM 2 by Google
(2023-05) Dromedary by CMU et al.
(2023-04) Pythia by EleutherAI et al.
(2023-04) LLaVA by UW–Madison&Microsoft
(2023-03) GPT 4 by OpenAI
(2023-03) PaLM-E by Google
(2023-03) LRU by DeepMind
(2023-02) Kosmos-1 by Microsoft
(2023-02) LLaMA by Meta
(2023-01) Flan 2022 Collection by Google
(2022-12) OPT-IML by Meta
(2022-11) Galactica by Meta
(2022-11) BLOOM by BigScience
(2022-11) HELM by Stanford
(2022-10) GLM-130B by Tsinghua
(2022-10) Flan-T5/PaLM by Google
(2022-09) Sparrow by DeepMind
(2022-06) METALM by Microsoft
(2022-06) BIG-bench by Google
(2022-06) Emergent Abilities by Google
(2022-05) UL2 by Google
(2022-05) OPT by Meta
(2022-04) Chinchilla by DeepMind
(2022-04) PaLM by Google
(2022-03) InstructGPT by OpenAI
(2022-01) Megatron-Turing NLG by Microsoft&NVIDIA
(2022-01) Minerva by Google
(2022-01) LaMDA by Google
(2022-01) COT by Google
(2021-12) Gopher by DeepMind
(2021-12) Retro by DeepMind
(2021-12) WebGPT by OpenAI
(2021-12) GLaM by Google
(2021-10) T0 by HuggingFace et al.
(2021-09) FLAN by Google
(2021-08) Foundation Models by Stanford
(2021-08) Codex by OpenAI
(2021-01) Switch Transformers by Google
(2020-05) GPT 3.0 by OpenAI
(2025-1) DeepSeek-R1 by DeepSeek