"continuous control tasks" Papers
13 papers found
Adversarial Policy Optimization for Offline Preference-based Reinforcement Learning
Hyungkyu Kang, Min-hwan Oh
ICLR 2025posterarXiv:2503.05306
3
citations
Efficient Model-Based Reinforcement Learning Through Optimistic Thompson Sampling
Jasmine Bayrooti, Carl Ek, Amanda Prorok
ICLR 2025posterarXiv:2410.04988
3
citations
Risk-Sensitive Variational Actor-Critic: A Model-Based Approach
Alonso Granados, Mohammadreza Ebrahimi, Jason Pacheco
ICLR 2025poster
1
citations
Absolute Policy Optimization: Enhancing Lower Probability Bound of Performance with High Confidence
Weiye Zhao, Feihan Li, Yifan Sun et al.
ICML 2024poster
ACE: Off-Policy Actor-Critic with Causality-Aware Entropy Regularization
Tianying Ji, Yongyuan Liang, Yan Zeng et al.
ICML 2024poster
A Minimaximalist Approach to Reinforcement Learning from Human Feedback
Gokul Swamy, Christoph Dann, Rahul Kidambi et al.
ICML 2024poster
Diffusion Model-Augmented Behavioral Cloning
Shang-Fu Chen, Hsiang-Chun Wang, Ming-Hao Hsu et al.
ICML 2024oral
EvIL: Evolution Strategies for Generalisable Imitation Learning
Silvia Sapora, Gokul Swamy, Christopher Lu et al.
ICML 2024poster
Hybrid Inverse Reinforcement Learning
Juntao Ren, Gokul Swamy, Steven Wu et al.
ICML 2024oral
Offline Actor-Critic Reinforcement Learning Scales to Large Models
Jost Tobias Springenberg, Abbas Abdolmaleki, Jingwei Zhang et al.
ICML 2024oral
Quality-Diversity Actor-Critic: Learning High-Performing and Diverse Behaviors via Value and Successor Features Critics
Luca Grillotti, Maxence Faldor, Borja G. León et al.
ICML 2024poster
Reward Shaping for Reinforcement Learning with An Assistant Reward Agent
Haozhe Ma, Kuankuan Sima, Thanh Vinh Vo et al.
ICML 2024poster
Seizing Serendipity: Exploiting the Value of Past Success in Off-Policy Actor-Critic
Tianying Ji, Yu Luo, Fuchun Sun et al.
ICML 2024poster