Oral "reinforcement fine-tuning" Papers
3 papers found
EgoThinker: Unveiling Egocentric Reasoning with Spatio-Temporal CoT
Baoqi Pei, Yifei Huang, Jilan Xu et al.
NeurIPS 2025oralarXiv:2510.23569
4
citations
Understanding Data Influence in Reinforcement Finetuning
Haoru Tan, Xiuzhe Wu, Sitong Wu et al.
NeurIPS 2025oral
VideoRFT: Incentivizing Video Reasoning Capability in MLLMs via Reinforced Fine-Tuning
Qi Wang, Yanrui Yu, Ye Yuan et al.
NeurIPS 2025oralarXiv:2505.12434
30
citations