2025 Poster "policy gradient theorem" Papers
2 papers found
Regret Analysis of Average-Reward Unichain MDPs via an Actor-Critic Approach
Swetha Ganesh, Vaneet Aggarwal
NEURIPS 2025posterarXiv:2505.19986
2
citations
Scalable Discrete Diffusion Samplers: Combinatorial Optimization and Statistical Physics
Sebastian Sanokowski, Wilhelm Berghammer, Haoyu Wang et al.
ICLR 2025posterarXiv:2502.08696
14
citations