NEURIPS 2025 "actor-critic methods" Papers
2 papers found
Regret Analysis of Average-Reward Unichain MDPs via an Actor-Critic Approach
Swetha Ganesh, Vaneet Aggarwal
NEURIPS 2025posterarXiv:2505.19986
2
citations
Succeed or Learn Slowly: Sample Efficient Off-Policy Reinforcement Learning for Mobile App Control
Georgios Papoudakis, Thomas Coste, Jianye Hao et al.
NEURIPS 2025posterarXiv:2509.01720