Poster by Shixuan Liu Papers
2 papers found
Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning
Shenzhi Wang, Le Yu, Chang Gao et al.
NeurIPS 2025posterarXiv:2506.01939
RoME: Domain-Robust Mixture-of-Experts for MILP Solution Prediction across Domains
Tianle Pu, Zijie Geng, Haoyang Liu et al.
NeurIPS 2025posterarXiv:2511.02331