Chongjie Zhang
19
Papers
8
Total Citations
Papers (19)
Learning to Plan Before Answering: Self-Teaching LLMs to Learn Abstract Plans for Problem Solving
ICLR 2025
4
citations
Enhancing Decision-Making of Large Language Models via Actor-Critic
ICML 2025arXiv
4
citations
Planning, Fast and Slow: Online Reinforcement Learning with Action-Free Offline Data via Multiscale Planners
ICML 2024
0
citations
Bayesian Design Principles for Offline-to-Online Reinforcement Learning
ICML 2024arXiv
0
citations
Bridging Imagination and Reality for Model-Based Deep Reinforcement Learning
NeurIPS 2020arXiv
0
citations
Episodic Multi-agent Reinforcement Learning with Curiosity-driven Exploration
NeurIPS 2021arXiv
0
citations
Celebrating Diversity in Shared Multi-Agent Reinforcement Learning
NeurIPS 2021arXiv
0
citations
Model-Based Reinforcement Learning via Imagination with Derived Memory
NeurIPS 2021
0
citations
On the Estimation Bias in Double Q-Learning
NeurIPS 2021arXiv
0
citations
Towards Understanding Cooperative Multi-Agent Q-Learning with Value Factorization
NeurIPS 2021arXiv
0
citations
Offline Reinforcement Learning with Reverse Model-based Imagination
NeurIPS 2021arXiv
0
citations
Low-Rank Modular Reinforcement Learning via Muscle Synergy
NeurIPS 2022arXiv
0
citations
RORL: Robust Offline Reinforcement Learning via Conservative Smoothing
NeurIPS 2022arXiv
0
citations
Non-Linear Coordination Graphs
NeurIPS 2022arXiv
0
citations
CUP: Critic-Guided Policy Reuse
NeurIPS 2022arXiv
0
citations
Safe Opponent-Exploitation Subgame Refinement
NeurIPS 2022
0
citations
LAPO: Latent-Variable Advantage-Weighted Policy Optimization for Offline Reinforcement Learning
NeurIPS 2022
0
citations
Unsupervised Behavior Extraction via Random Intent Priors
NeurIPS 2023arXiv
0
citations
Conservative Offline Policy Adaptation in Multi-Agent Games
NeurIPS 2023
0
citations