Chongjie Zhang

19
Papers
8
Total Citations

Papers (19)

Learning to Plan Before Answering: Self-Teaching LLMs to Learn Abstract Plans for Problem Solving

ICLR 2025
4
citations

Enhancing Decision-Making of Large Language Models via Actor-Critic

ICML 2025arXiv
4
citations

Planning, Fast and Slow: Online Reinforcement Learning with Action-Free Offline Data via Multiscale Planners

ICML 2024
0
citations

Bayesian Design Principles for Offline-to-Online Reinforcement Learning

ICML 2024arXiv
0
citations

Bridging Imagination and Reality for Model-Based Deep Reinforcement Learning

NeurIPS 2020arXiv
0
citations

Episodic Multi-agent Reinforcement Learning with Curiosity-driven Exploration

NeurIPS 2021arXiv
0
citations

Celebrating Diversity in Shared Multi-Agent Reinforcement Learning

NeurIPS 2021arXiv
0
citations

Model-Based Reinforcement Learning via Imagination with Derived Memory

NeurIPS 2021
0
citations

On the Estimation Bias in Double Q-Learning

NeurIPS 2021arXiv
0
citations

Towards Understanding Cooperative Multi-Agent Q-Learning with Value Factorization

NeurIPS 2021arXiv
0
citations

Offline Reinforcement Learning with Reverse Model-based Imagination

NeurIPS 2021arXiv
0
citations

Low-Rank Modular Reinforcement Learning via Muscle Synergy

NeurIPS 2022arXiv
0
citations

RORL: Robust Offline Reinforcement Learning via Conservative Smoothing

NeurIPS 2022arXiv
0
citations

Non-Linear Coordination Graphs

NeurIPS 2022arXiv
0
citations

CUP: Critic-Guided Policy Reuse

NeurIPS 2022arXiv
0
citations

Safe Opponent-Exploitation Subgame Refinement

NeurIPS 2022
0
citations

LAPO: Latent-Variable Advantage-Weighted Policy Optimization for Offline Reinforcement Learning

NeurIPS 2022
0
citations

Unsupervised Behavior Extraction via Random Intent Priors

NeurIPS 2023arXiv
0
citations

Conservative Offline Policy Adaptation in Multi-Agent Games

NeurIPS 2023
0
citations