ICLR 2025 "function approximation" Papers
5 papers found
Adversarial Policy Optimization for Offline Preference-based Reinforcement Learning
Hyungkyu Kang, Min-hwan Oh
ICLR 2025posterarXiv:2503.05306
3
citations
Learning a Fast Mixing Exogenous Block MDP using a Single Trajectory
Alexander Levine, Peter Stone, Amy Zhang
ICLR 2025posterarXiv:2410.03016
1
citations
Revisiting a Design Choice in Gradient Temporal Difference Learning
Xiaochi Qian, Shangtong Zhang
ICLR 2025oralarXiv:2308.01170
6
citations
Second Order Bounds for Contextual Bandits with Function Approximation
Aldo Pacchiano
ICLR 2025posterarXiv:2409.16197
7
citations
Stochastic Semi-Gradient Descent for Learning Mean Field Games with Population-Aware Function Approximation
Chenyu Zhang, Xu Chen, Xuan Di
ICLR 2025posterarXiv:2408.08192
7
citations