Poster by Haobin Lin Papers
2 papers found
Exploring Polyglot Harmony: On Multilingual Data Allocation for Large Language Models Pretraining
Ping Guo, Yubing Ren, BINBINLIU et al.
NeurIPS 2025posterarXiv:2509.15556
1
citations
MuRating: A High Quality Data Selecting Approach to Multilingual Large Language Model Pretraining
Zhixun Chen, Ping Guo, Wenhan Han et al.
NeurIPS 2025poster