Poster by Michael Qizhe Shieh Papers
5 papers found
LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization
Guanzheng Chen, Xin Li, Michael Qizhe Shieh et al.
ICLR 2025posterarXiv:2502.13922
12
citations
MixEval-X: Any-to-any Evaluations from Real-world Data Mixture
Jinjie Ni, Yifan Song, Deepanway Ghosal et al.
ICLR 2025posterarXiv:2410.13754
NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation
Xiangyan Liu, Jinjie Ni, Zijian Wu et al.
NEURIPS 2025posterarXiv:2504.13055
The Emergence of Abstract Thought in Large Language Models Beyond Any Language
Yuxin Chen, Yiran Zhao, Yang Zhang et al.
NEURIPS 2025posterarXiv:2506.09890
Understanding and Enhancing Safety Mechanisms of LLMs via Safety-Specific Neuron
Yiran Zhao, Wenxuan Zhang, Yuxi Xie et al.
ICLR 2025poster