"multi-agent systems" Papers

22 papers found

AgentBreeder: Mitigating the AI Safety Risks of Multi-Agent Scaffolds via Self-Improvement

J Rosser, Jakob Foerster

NeurIPS 2025spotlightarXiv:2502.00757
4
citations

Agent-Oriented Planning in Multi-Agent Systems

Ao LI, Yuexiang Xie, Songze Li et al.

ICLR 2025posterarXiv:2410.02189
21
citations

Do as We Do, Not as You Think: the Conformity of Large Language Models

Zhiyuan Weng, Guikun Chen, Wenguan Wang

ICLR 2025posterarXiv:2501.13381
18
citations

DUET: Decentralized Bilevel Optimization without Lower-Level Strong Convexity

Zhen Qin, Zhuqing Liu, Songtao Lu et al.

ICLR 2025poster
1
citations

Graph Neural Networks Gone Hogwild

Olga Solodova, Nick Richardson, Deniz Oktay et al.

ICLR 2025posterarXiv:2407.00494
1
citations

Knowledge Starts with Practice: Knowledge-Aware Exercise Generative Recommendation with Adaptive Multi-Agent Cooperation

Yangtao Zhou, Hua Chu, chen et al.

NeurIPS 2025poster

KVCOMM: Online Cross-context KV-cache Communication for Efficient LLM-based Multi-agent Systems

Hancheng Ye, Zhengqi Gao, Mingyuan Ma et al.

NeurIPS 2025posterarXiv:2510.12872
1
citations

Many LLMs Are More Utilitarian Than One

Anita Keshmirian, Razan Baltaji, Babak Hemmatian et al.

NeurIPS 2025oralarXiv:2507.00814
2
citations

MetaMind: Modeling Human Social Thoughts with Metacognitive Multi-Agent Systems

Xuanming Zhang, Yuxuan Chen, Samuel (Min-Hsuan) Yeh et al.

NeurIPS 2025oralarXiv:2505.18943
6
citations

Rainbow Delay Compensation: A Multi-Agent Reinforcement Learning Framework for Mitigating Observation Delays

Songchen Fu, Siang Chen, Shaojing Zhao et al.

NeurIPS 2025poster

Towards Principled Unsupervised Multi-Agent Reinforcement Learning

Riccardo Zamboni, Mirco Mutti, Marcello Restelli

NeurIPS 2025posterarXiv:2502.08365
2
citations

Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast

Xiangming Gu, Xiaosen Zheng, Tianyu Pang et al.

ICML 2024poster

Communication-Efficient Collaborative Regret Minimization in Multi-Armed Bandits

Nikolai Karpov, Qin Zhang

AAAI 2024paperarXiv:2301.11442
2
citations

CompeteAI: Understanding the Competition Dynamics of Large Language Model-based Agents

Qinlin Zhao, Jindong Wang, Yixuan Zhang et al.

ICML 2024poster

Configurable Mirror Descent: Towards a Unification of Decision Making

Pengdeng Li, Shuxin Li, Chang Yang et al.

ICML 2024poster

EarnHFT: Efficient Hierarchical Reinforcement Learning for High Frequency Trading

Molei Qin, Shuo Sun, Wentao Zhang et al.

AAAI 2024paperarXiv:2309.12891
24
citations

Finite-Time Frequentist Regret Bounds of Multi-Agent Thompson Sampling on Sparse Hypergraphs

Tianyuan Jin, Hao-Lun Hsu, William Chang et al.

AAAI 2024paperarXiv:2312.15549

Locally Interdependent Multi-Agent MDP: Theoretical Framework for Decentralized Agents with Dynamic Dependencies

Alex DeWeese, Guannan Qu

ICML 2024poster

On Alternating-Time Temporal Logic, Hyperproperties, and Strategy Sharing

Raven Beutner, Bernd Finkbeiner

AAAI 2024paperarXiv:2312.12403
2
citations

Responsibility in Extensive Form Games

AAAI 2024paperarXiv:2312.07637

Sequential Asynchronous Action Coordination in Multi-Agent Systems: A Stackelberg Decision Transformer Approach

Bin Zhang, Hangyu Mao, Lijuan Li et al.

ICML 2024poster

Two-sided Competing Matching Recommendation Markets With Quota and Complementary Preferences Constraints

Yuantong Li, Guang Cheng, Xiaowu Dai

ICML 2024poster