2025 "data selection" Papers
5 papers found
Efficient Top-m Data Values Identification for Data Selection
Xiaoqiang Lin, Xinyi Xu, See-Kiong Ng et al.
ICLR 2025poster
Enhancing Multilingual LLM Pretraining with Model-Based Data Selection
Bettina Messmer, Vinko Sabolčec, Martin Jaggi
NeurIPS 2025posterarXiv:2502.10361
10
citations
Graph Data Selection for Domain Adaptation: A Model-Free Approach
Ting-Wei Li, Ruizhong Qiu, Hanghang Tong
NeurIPS 2025posterarXiv:2505.17293
4
citations
Group-Level Data Selection for Efficient Pretraining
Zichun Yu, Fei Peng, Jie Lei et al.
NeurIPS 2025posterarXiv:2502.14709
1
citations
Reinforcement Learning-Guided Data Selection via Redundancy Assessment
Suorong Yang, Peijia Li, Furao Shen et al.
ICCV 2025posterarXiv:2506.21037
1
citations