2025 Poster "policy gradient methods" Papers
7 papers found
$\phi$-Update: A Class of Policy Update Methods with Policy Convergence Guarantee
Wenye Li, Jiacai Liu, Ke Wei
ICLR 2025poster
3
citations
A Policy-Gradient Approach to Solving Imperfect-Information Games with Best-Iterate Convergence
Mingyang Liu, Gabriele Farina, Asuman Ozdaglar
ICLR 2025posterarXiv:2408.00751
3
citations
Cross-Domain Off-Policy Evaluation and Learning for Contextual Bandits
Yuta Natsubori, Masataka Ushiku, Yuta Saito
ICLR 2025poster
Mitigating Information Loss in Tree-Based Reinforcement Learning via Direct Optimization
Sascha Marton, Tim Grams, Florian Vogt et al.
ICLR 2025posterarXiv:2408.08761
4
citations
On the Convergence of Projected Policy Gradient for Any Constant Step Sizes
Jiacai Liu, Wenye Li, Dachao Lin et al.
NeurIPS 2025posterarXiv:2311.01104
4
citations
Policy Gradient with Kernel Quadrature
Tetsuro Morimura, Satoshi Hayakawa
ICLR 2025posterarXiv:2310.14768
1
citations
REINFORCE Converges to Optimal Policies with Any Learning Rate
Samuel Robertson, Thang Chu, Bo Dai et al.
NeurIPS 2025poster