2025 "multi-agent learning" Papers
3 papers found
Learning from Delayed Feedback in Games via Extra Prediction
Yuma Fujimoto, Kenshi Abe, Kaito Ariu
NEURIPS 2025posterarXiv:2509.22426
MACPO: Weak-to-Strong Alignment via Multi-Agent Contrastive Preference Optimization
Yougang Lyu, Lingyong Yan, Zihan Wang et al.
ICLR 2025oralarXiv:2410.07672
Sparta Alignment: Collectively Aligning Multiple Language Models through Combat
Yuru Jiang, Wenxuan Ding, Shangbin Feng et al.
NEURIPS 2025posterarXiv:2506.04721
3
citations