NeurIPS Oral "reinforcement learning" Papers
6 papers found
Conformal Prediction Beyond the Horizon: Distribution-Free Inference for Policy Evaluation
Feichen Gan, Lu Youcun, Yingying Zhang et al.
NeurIPS 2025oralarXiv:2510.26026
Heterogeneous Graph Transformers for Simultaneous Mobile Multi-Robot Task Allocation and Scheduling under Temporal Constraints
Batuhan Altundas, Shengkang Chen, Shivika Singh et al.
NeurIPS 2025oral
Meta-learning how to Share Credit among Macro-Actions
Ionel-Alexandru Hosu, Traian Rebedea, Razvan Pascanu
NeurIPS 2025oralarXiv:2506.13690
No-Regret Thompson Sampling for Finite-Horizon Markov Decision Processes with Gaussian Processes
Jasmine Bayrooti, Sattar Vakili, Amanda Prorok et al.
NeurIPS 2025oralarXiv:2510.20725
Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System Collaboration
Hao Zhong, Muzhi Zhu, Zongze Du et al.
NeurIPS 2025oralarXiv:2505.20256
12
citations
VideoChat-R1.5: Visual Test-Time Scaling to Reinforce Multimodal Reasoning by Iterative Perception
Ziang Yan, Yinan He, Xinhao Li et al.
NeurIPS 2025oralarXiv:2509.21100
13
citations