2025 "q-learning" Papers
2 papers found
Actor-Free Continuous Control via Structurally Maximizable Q-Functions
Yigit Korkmaz, Urvi Bhuwania, Ayush Jain et al.
NeurIPS 2025posterarXiv:2510.18828
A General-Purpose Theorem for High-Probability Bounds of Stochastic Approximation with Polyak Averaging
Sajad Khodadadian, Martin Zubeldia
NeurIPS 2025oralarXiv:2505.21796
2
citations