Shihan Dou
4
Papers
3
Total Citations
Papers (4)
EvaLearn: Quantifying the Learning Capability and Efficiency of LLMs via Sequential Problem Solving
NeurIPS 2025
3
citations
Alleviating Shifted Distribution in Human Preference Alignment through Meta-Learning
AAAI 2025
0
citations
Linear Alignment: A Closed-form Solution for Aligning Human Preferences without Tuning and Feedback
ICML 2024
0
citations
Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning
ICML 2024
0
citations