Tongzhou Mu
4
Papers
39
Total Citations
Papers (4)
Policy Decorator: Model-Agnostic Online Refinement for Large Policy Model
ICLR 2025arXiv
27
citations
DrS: Learning Reusable Dense Rewards for Multi-Stage Tasks
ICLR 2024arXiv
7
citations
When Should We Prefer State-to-Visual DAgger over Visual Reinforcement Learning?
AAAI 2025arXiv
5
citations
Refactoring Policy for Compositional Generalizability using Self-Supervised Object Proposals
NeurIPS 2020arXiv
0
citations