Shangtong Zhang
7
Papers
30
Total Citations
Papers (7)
Transformers Can Learn Temporal Difference Methods for In-Context Reinforcement Learning
ICLR 2025
14
citations
Revisiting a Design Choice in Gradient Temporal Difference Learning
ICLR 2025
6
citations
Linear $Q$-Learning Does Not Diverge in $L^2$: Convergence Rates to a Bounded Set
ICML 2025
4
citations
Doubly Optimal Policy Evaluation for Reinforcement Learning
ICLR 2025
3
citations
Efficient Multi-Policy Evaluation for Reinforcement Learning
AAAI 2025
2
citations
Efficient Policy Evaluation with Safety Constraint for Reinforcement Learning
ICLR 2025
1
citations
Efficient Policy Evaluation with Offline Data Informed Behavior Policy Design
ICML 2024
0
citations