
Milestone Papers
Improving alignment of dialogue agents via targeted human judgements
(2022-09) Sparrow by DeepMind
(2022-09) Sparrow by DeepMind
(2024-12) Qwen2.5 by Alibaba