ICML "continuous control tasks" Papers
10 papers found
Absolute Policy Optimization: Enhancing Lower Probability Bound of Performance with High Confidence
Weiye Zhao, Feihan Li, Yifan Sun et al.
ICML 2024poster
ACE: Off-Policy Actor-Critic with Causality-Aware Entropy Regularization
Tianying Ji, Yongyuan Liang, Yan Zeng et al.
ICML 2024poster
A Minimaximalist Approach to Reinforcement Learning from Human Feedback
Gokul Swamy, Christoph Dann, Rahul Kidambi et al.
ICML 2024posterarXiv:2401.04056
Diffusion Model-Augmented Behavioral Cloning
Shang-Fu Chen, Hsiang-Chun Wang, Ming-Hao Hsu et al.
ICML 2024oralarXiv:2302.13335
EvIL: Evolution Strategies for Generalisable Imitation Learning
Silvia Sapora, Gokul Swamy, Christopher Lu et al.
ICML 2024poster
Hybrid Inverse Reinforcement Learning
Juntao Ren, Gokul Swamy, Steven Wu et al.
ICML 2024oralarXiv:2402.08848
Offline Actor-Critic Reinforcement Learning Scales to Large Models
Jost Tobias Springenberg, Abbas Abdolmaleki, Jingwei Zhang et al.
ICML 2024oral
Quality-Diversity Actor-Critic: Learning High-Performing and Diverse Behaviors via Value and Successor Features Critics
Luca Grillotti, Maxence Faldor, Borja G. León et al.
ICML 2024posterarXiv:2403.09930
Reward Shaping for Reinforcement Learning with An Assistant Reward Agent
Haozhe Ma, Kuankuan Sima, Thanh Vinh Vo et al.
ICML 2024poster
Seizing Serendipity: Exploiting the Value of Past Success in Off-Policy Actor-Critic
Tianying Ji, Yu Luo, Fuchun Sun et al.
ICML 2024poster