"reinforcement fine-tuning" Papers
6 papers found
AceSearcher: Bootstrapping Reasoning and Search for LLMs via Reinforced Self-Play
Ran Xu, Yuchen Zhuang, Zihan Dong et al.
NeurIPS 2025spotlightarXiv:2509.24193
3
citations
EgoThinker: Unveiling Egocentric Reasoning with Spatio-Temporal CoT
Baoqi Pei, Yifei Huang, Jilan Xu et al.
NeurIPS 2025oralarXiv:2510.23569
4
citations
Mesh-RFT: Enhancing Mesh Generation via Fine-grained Reinforcement Fine-Tuning
Jian Liu, Jing Xu, Song Guo et al.
NeurIPS 2025spotlightarXiv:2505.16761
7
citations
RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics
Enshen Zhou, Jingkun An, Cheng Chi et al.
NeurIPS 2025posterarXiv:2506.04308
51
citations
To Think or Not To Think: A Study of Thinking in Rule-Based Visual Reinforcement Fine-Tuning
Ming Li, Jike Zhong, Shitian Zhao et al.
NeurIPS 2025spotlight
Visual-RFT: Visual Reinforcement Fine-Tuning
Ziyu Liu, Zeyi Sun, Yuhang Zang et al.
ICCV 2025posterarXiv:2503.01785
347
citations