by Michael Qizhe Shieh Papers
5 papers found
LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization
Guanzheng Chen, Xin Li, Michael Qizhe Shieh et al.
ICLR 2025posterarXiv:2502.13922
12
citations
MixEval-X: Any-to-any Evaluations from Real-world Data Mixture
Jinjie Ni, Yifan Song, Deepanway Ghosal et al.
ICLR 2025poster
NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation
Xiangyan Liu, Jinjie Ni, Zijian Wu et al.
NeurIPS 2025poster
The Emergence of Abstract Thought in Large Language Models Beyond Any Language
Yuxin Chen, Yiran Zhao, Yang Zhang et al.
NeurIPS 2025poster
Understanding and Enhancing Safety Mechanisms of LLMs via Safety-Specific Neuron
Yiran Zhao, Wenxuan Zhang, Yuxi Xie et al.
ICLR 2025poster