"reinforcement learning framework" Papers
3 papers found
Critic-V: VLM Critics Help Catch VLM Errors in Multimodal Reasoning
Di Zhang, Jingdi Lei, Junxian Li et al.
CVPR 2025posterarXiv:2411.18203
30
citations
Train on Pins and Test on Obstacles for Rectilinear Steiner Minimum Tree
Xingbo Du, Ruizhe Zhong, Junchi Yan
NeurIPS 2025poster
Dialogue for Prompting: A Policy-Gradient-Based Discrete Prompt Generation for Few-Shot Learning
Chengzhengxu Li, Xiaoming Liu, Yichen Wang et al.
AAAI 2024paperarXiv:2308.07272
7
citations