2025 "reinforcement learning training" Papers
5 papers found
Adaptive teachers for amortized samplers
Minsu Kim, Sanghyeok Choi, Taeyoung Yun et al.
ICLR 2025posterarXiv:2410.01432
15
citations
Highly Parallelized Reinforcement Learning Training with Relaxed Assignment Dependencies
Zhouyu He, Peng Qiao, Rongchun Li et al.
AAAI 2025paperarXiv:2502.20190
PABBO: Preferential Amortized Black-Box Optimization
Xinyu Zhang, Daolang Huang, Samuel Kaski et al.
ICLR 2025posterarXiv:2503.00924
4
citations
Seeing the Arrow of Time in Large Multimodal Models
Zihui (Sherry) Xue, Romy Luo, Kristen Grauman
NEURIPS 2025oralarXiv:2506.03340
5
citations
Training a Scientific Reasoning Model for Chemistry
Siddharth Narayanan, James Braza, Ryan-Rhys Griffiths et al.
NEURIPS 2025posterarXiv:2506.17238
22
citations