Ganqu Cui

8

Papers

582

Total Citations

Papers (8)

RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback

Advancing LLM Reasoning Generalists with Preference Trees

RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V Trustworthiness

Scaling Physical Reasoning with the PHYSICS Dataset

ULTRAFEEDBACK: Boosting Language Models with Scaled AI Feedback

Moderate-fitting as a Natural Backdoor Defender for Pre-trained Language Models

A Unified Evaluation of Textual Backdoor Learning: Frameworks and Benchmarks

Revisiting Out-of-distribution Robustness in NLP: Benchmarks, Analysis, and LLMs Evaluations