"temporal difference learning" Papers

12 papers found

A General-Purpose Theorem for High-Probability Bounds of Stochastic Approximation with Polyak Averaging

Sajad Khodadadian, Martin Zubeldia

NeurIPS 2025oralarXiv:2505.21796
2
citations

Fast and Slow Streams for Online Time Series Forecasting Without Information Leakage

Ying-yee Ava Lau, Zhiwen Shao, Dit-Yan Yeung

ICLR 2025oral
8
citations

On the Linear Speedup of Personalized Federated Reinforcement Learning with Shared Representations

GUOJUN XIONG, Shufan Wang, Daniel Jiang et al.

ICLR 2025oralarXiv:2411.15014
3
citations

Physics-informed Temporal Difference Metric Learning for Robot Motion Planning

Ruiqi Ni, zherong pan, Ahmed Hussain Qureshi

ICLR 2025oralarXiv:2505.05691
2
citations

Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining

Jie Cheng, Ruixi Qiao, ma yingwei et al.

ICLR 2025oralarXiv:2410.00564
7
citations

Temporal Difference Learning: Why It Can Be Fast and How It Will Be Faster

Patrick Schnell, Luca Guastoni, Nils Thuerey

ICLR 2025oral

Transformers Can Learn Temporal Difference Methods for In-Context Reinforcement Learning

Jiuqi Wang, Ethan Blaser, Hadi Daneshmand et al.

ICLR 2025oralarXiv:2405.13861
14
citations

An Improved Finite-time Analysis of Temporal Difference Learning with Deep Neural Networks

Zhifa Ke, Zaiwen Wen, Junyu Zhang

ICML 2024oral

ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL

Yifei Zhou, Andrea Zanette, Jiayi Pan et al.

ICML 2024oral

Discerning Temporal Difference Learning

AAAI 2024paperarXiv:2310.08091

Non-Asymptotic Analysis for Single-Loop (Natural) Actor-Critic with Compatible Function Approximation

Yudan Wang, Yue Wang, Yi Zhou et al.

ICML 2024oral

Pausing Policy Learning in Non-stationary Reinforcement Learning

Hyunin Lee, Ming Jin, Javad Lavaei et al.

ICML 2024oral