2025 Poster by Haobin Lin Papers
2 papers found
Exploring Polyglot Harmony: On Multilingual Data Allocation for Large Language Models Pretraining
Ping Guo, Yubing Ren, BINBINLIU et al.
NEURIPS 2025posterarXiv:2509.15556
1
citations
MuRating: A High Quality Data Selecting Approach to Multilingual Large Language Model Pretraining
Zhixun Chen, Ping Guo, Wenhan Han et al.
NEURIPS 2025posterarXiv:2507.01785