2024 Oral "policy gradient methods" Papers
2 papers found
Mean Field Langevin Actor-Critic: Faster Convergence and Global Optimality beyond Lazy Learning
Kakei Yamamoto, Kazusato Oko, Zhuoran Yang et al.
ICML 2024oral
Non-Asymptotic Analysis for Single-Loop (Natural) Actor-Critic with Compatible Function Approximation
Yudan Wang, Yue Wang, Yi Zhou et al.
ICML 2024oral