Ganqu Cui
6
Papers
704
Total Citations
Papers (6)
RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback
CVPR 2024
344
citations
Advancing LLM Reasoning Generalists with Preference Trees
ICLR 2025arXiv
179
citations
TTRL: Test-Time Reinforcement Learning
NeurIPS 2025arXiv
122
citations
RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V Trustworthiness
CVPR 2025
54
citations
Scaling Physical Reasoning with the PHYSICS Dataset
NeurIPS 2025
5
citations
ULTRAFEEDBACK: Boosting Language Models with Scaled AI Feedback
ICML 2024
0
citations