"on-policy learning" Papers
2 papers found
Mitigating Information Loss in Tree-Based Reinforcement Learning via Direct Optimization
Sascha Marton, Tim Grams, Florian Vogt et al.
ICLR 2025posterarXiv:2408.08761
4
citations
Turning Sand to Gold: Recycling Data to Bridge On-Policy and Off-Policy Learning via Causal Bound
Tal Fiskus, Uri Shaham
NeurIPS 2025posterarXiv:2507.11269