Oral "temporal difference learning" Papers
11 papers found
A General-Purpose Theorem for High-Probability Bounds of Stochastic Approximation with Polyak Averaging
Sajad Khodadadian, Martin Zubeldia
NeurIPS 2025oralarXiv:2505.21796
2
citations
Fast and Slow Streams for Online Time Series Forecasting Without Information Leakage
Ying-yee Ava Lau, Zhiwen Shao, Dit-Yan Yeung
ICLR 2025oral
8
citations
On the Linear Speedup of Personalized Federated Reinforcement Learning with Shared Representations
GUOJUN XIONG, Shufan Wang, Daniel Jiang et al.
ICLR 2025oralarXiv:2411.15014
3
citations
Physics-informed Temporal Difference Metric Learning for Robot Motion Planning
Ruiqi Ni, zherong pan, Ahmed Hussain Qureshi
ICLR 2025oralarXiv:2505.05691
2
citations
Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining
Jie Cheng, Ruixi Qiao, ma yingwei et al.
ICLR 2025oralarXiv:2410.00564
7
citations
Temporal Difference Learning: Why It Can Be Fast and How It Will Be Faster
Patrick Schnell, Luca Guastoni, Nils Thuerey
ICLR 2025oral
Transformers Can Learn Temporal Difference Methods for In-Context Reinforcement Learning
Jiuqi Wang, Ethan Blaser, Hadi Daneshmand et al.
ICLR 2025oralarXiv:2405.13861
14
citations
An Improved Finite-time Analysis of Temporal Difference Learning with Deep Neural Networks
Zhifa Ke, Zaiwen Wen, Junyu Zhang
ICML 2024oral
ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL
Yifei Zhou, Andrea Zanette, Jiayi Pan et al.
ICML 2024oral
Non-Asymptotic Analysis for Single-Loop (Natural) Actor-Critic with Compatible Function Approximation
Yudan Wang, Yue Wang, Yi Zhou et al.
ICML 2024oral
Pausing Policy Learning in Non-stationary Reinforcement Learning
Hyunin Lee, Ming Jin, Javad Lavaei et al.
ICML 2024oral