"actor-critic methods" Papers

10 papers found

Filters:actor-critic methods Clear all

Conference

AAAI 2025 (3,028)CVPR 2025 (2,873)ICCV 2025 (2,701)ICLR 2025 (3,827)ICML 2025 (3,340)ISMAR 2025 (229)NeurIPS 2025 (5,858)AAAI 2024 (2,289)CVPR 2024 (2,716)ECCV 2024 (2,387)ICLR 2024 (2,297)ICML 2024 (2,635)

Paper Type

poster (24,624)paper (8,140)oral (1,594)spotlight (1,421)highlight (975)

Succeed or Learn Slowly: Sample Efficient Off-Policy Reinforcement Learning for Mobile App Control

Georgios Papoudakis, Thomas Coste, Jianye Hao et al.

NeurIPS 2025posterarXiv:2509.01720

${\rm E}(3)$-Equivariant Actor-Critic Methods for Cooperative Multi-Agent Reinforcement Learning

Dingyang Chen, Qi Zhang

ICML 2024posterarXiv:2308.11842

Langevin Policy for Safe Reinforcement Learning

Fenghao Lei, Long Yang, Shiting Wen et al.

ICML 2024poster

Multi-Agent Reinforcement Learning with Hierarchical Coordination for Emergency Responder Stationing

Amutheezan Sivagnanam, Ava Pettet, Hunter Lee et al.

ICML 2024posterarXiv:2405.13205

Non-Asymptotic Analysis for Single-Loop (Natural) Actor-Critic with Compatible Function Approximation

Yudan Wang, Yue Wang, Yi Zhou et al.

ICML 2024oralarXiv:2406.01762

On the Second-Order Convergence of Biased Policy Gradient Algorithms

Siqiao Mu, Diego Klabjan

ICML 2024posterarXiv:2311.02546

Overestimation, Overfitting, and Plasticity in Actor-Critic: the Bitter Lesson of Reinforcement Learning

Michal Nauman, Michał Bortkiewicz, Piotr Milos et al.

ICML 2024posterarXiv:2403.00514

Quality-Diversity Actor-Critic: Learning High-Performing and Diverse Behaviors via Value and Successor Features Critics

Luca Grillotti, Maxence Faldor, Borja G. León et al.

ICML 2024posterarXiv:2403.09930

Seizing Serendipity: Exploiting the Value of Past Success in Off-Policy Actor-Critic

Tianying Ji, Yu Luo, Fuchun Sun et al.

ICML 2024posterarXiv:2306.02865

Trust the Model Where It Trusts Itself - Model-Based Actor-Critic with Uncertainty-Aware Rollout Adaption

Bernd Frauenknecht, Artur Eisele, Devdutt Subhasish et al.

ICML 2024posterarXiv:2405.19014