ICLR Poster "reinforcement learning" Papers
15 papers found
Adding Conditional Control to Diffusion Models with Reinforcement Learning
Yulai Zhao, Masatoshi Uehara, Gabriele Scalia et al.
ICLR 2025posterarXiv:2406.12120
13
citations
An Optimal Discriminator Weighted Imitation Perspective for Reinforcement Learning
Haoran Xu, Shuozhe Li, Harshit Sikchi et al.
ICLR 2025posterarXiv:2504.13368
2
citations
Computationally Efficient RL under Linear Bellman Completeness for Deterministic Dynamics
Runzhe Wu, Ayush Sekhari, Akshay Krishnamurthy et al.
ICLR 2025posterarXiv:2406.11810
3
citations
Hierarchical World Models as Visual Whole-Body Humanoid Controllers
Nick Hansen, Jyothir S V, Vlad Sobal et al.
ICLR 2025posterarXiv:2405.18418
20
citations
Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models
Cong Lu, Shengran Hu, Jeff Clune
ICLR 2025posterarXiv:2405.15143
26
citations
Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control Tasks
Michael Matthews, Michael Beukman, Chris Lu et al.
ICLR 2025posterarXiv:2410.23208
20
citations
Learning Diverse Attacks on Large Language Models for Robust Red-Teaming and Safety Tuning
Seanie Lee, Minsu Kim, Lynn Cherif et al.
ICLR 2025posterarXiv:2405.18540
44
citations
MetaUrban: An Embodied AI Simulation Platform for Urban Micromobility
Wayne Wu, Honglin He, Jack He et al.
ICLR 2025posterarXiv:2407.08725
11
citations
Policy Gradient with Kernel Quadrature
Tetsuro Morimura, Satoshi Hayakawa
ICLR 2025posterarXiv:2310.14768
1
citations
Reinforcement learning with combinatorial actions for coupled restless bandits
Lily Xu, Bryan Wilder, Elias Khalil et al.
ICLR 2025posterarXiv:2503.01919
5
citations
Safety Representations for Safer Policy Learning
Kaustubh Mani, Vincent Mai, Charlie Gauthier et al.
ICLR 2025posterarXiv:2502.20341
1
citations
Stable Hadamard Memory: Revitalizing Memory-Augmented Agents for Reinforcement Learning
Hung Le, Dung Nguyen, Kien Do et al.
ICLR 2025posterarXiv:2410.10132
6
citations
Swift Hydra: Self-Reinforcing Generative Framework for Anomaly Detection with Multiple Mamba Models
Hoang Khoi Nguyen Do, Truc Nguyen, Malik Hassanaly et al.
ICLR 2025posterarXiv:2503.06413
2
citations
VTDexManip: A Dataset and Benchmark for Visual-tactile Pretraining and Dexterous Manipulation with Reinforcement Learning
Qingtao Liu, Yu Cui, Zhengnan Sun et al.
ICLR 2025poster
11
citations
WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct
Haipeng Luo, Qingfeng Sun, Can Xu et al.
ICLR 2025posterarXiv:2308.09583
637
citations