Chao Yu
7
Papers
18
Total Citations
Papers (7)
ReinFlow: Fine-tuning Flow Matching Policy with Online Reinforcement Learning
NeurIPS 2025
10
citations
Offline Multi-Agent Reinforcement Learning via In-Sample Sequential Policy Optimization
AAAI 2025
8
citations
Conservative Offline Goal-Conditioned Implicit V-Learning
ICML 2025
0
citations
Language Agents with Reinforcement Learning for Strategic Play in the Werewolf Game
ICML 2024
0
citations
Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study
ICML 2024
0
citations
Accelerate Multi-Agent Reinforcement Learning in Zero-Sum Games with Subgame Curriculum Learning
AAAI 2024
0
citations
Rapid Learning in Constrained Minimax Games with Negative Momentum
AAAI 2025
0
citations