Ganqu Cui
8
Papers
582
Total Citations
Papers (8)
RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback
CVPR 2024
344
citations
Advancing LLM Reasoning Generalists with Preference Trees
ICLR 2025arXiv
179
citations
RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V Trustworthiness
CVPR 2025
54
citations
Scaling Physical Reasoning with the PHYSICS Dataset
NeurIPS 2025
5
citations
ULTRAFEEDBACK: Boosting Language Models with Scaled AI Feedback
ICML 2024
0
citations
Moderate-fitting as a Natural Backdoor Defender for Pre-trained Language Models
NeurIPS 2022
0
citations
A Unified Evaluation of Textual Backdoor Learning: Frameworks and Benchmarks
NeurIPS 2022
0
citations
Revisiting Out-of-distribution Robustness in NLP: Benchmarks, Analysis, and LLMs Evaluations
NeurIPS 2023
0
citations