Shuyue Hu
5
Papers
41
Total Citations
Papers (5)
ReMA: Learning to Meta-Think for LLMs with Multi-agent Reinforcement Learning
NeurIPS 2025arXiv
36
citations
Scaling Physical Reasoning with the PHYSICS Dataset
NeurIPS 2025
5
citations
Configurable Mirror Descent: Towards a Unification of Decision Making
ICML 2024
0
citations
Modelling the Dynamics of Multiagent Q-Learning in Repeated Symmetric Games: a Mean Field Theoretic Approach
NeurIPS 2019
0
citations
The Best of Both Worlds in Network Population Games: Reaching Consensus and Convergence to Equilibrium
NeurIPS 2023
0
citations