Rui Zheng
4
Papers
0
Total Citations
Papers (4)
SPA-VL: A Comprehensive Safety Preference Alignment Dataset for Vision Language Models
CVPR 2025
0
citations
Alleviating Shifted Distribution in Human Preference Alignment through Meta-Learning
AAAI 2025
0
citations
Linear Alignment: A Closed-form Solution for Aligning Human Preferences without Tuning and Feedback
ICML 2024
0
citations
Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning
ICML 2024
0
citations