"q-learning methods" Papers
2 papers found
ShiQ: Bringing back Bellman to LLMs
Pierre Clavier, Nathan Grinsztajn, Raphaël Avalos et al.
NeurIPS 2025posterarXiv:2505.11081
1
citations
Cautiously-Optimistic Knowledge Sharing for Cooperative Multi-Agent Reinforcement Learning
Yanwen Ba, Xuan Liu, Xinning Chen et al.
AAAI 2024paperarXiv:2312.12095
5
citations