"training data selection" Papers
3 papers found
DATE-LM: Benchmarking Data Attribution Evaluation for Large Language Models
Cathy Jiao, Yijun Pan, Emily Xiao et al.
NEURIPS 2025posterarXiv:2507.09424
DRoP: Distributionally Robust Data Pruning
Artem Vysogorets, Kartik Ahuja, Julia Kempe
ICLR 2025posterarXiv:2404.05579
4
citations
Understanding Data Influence in Reinforcement Finetuning
Haoru Tan, Xiuzhe Wu, Sitong Wu et al.
NEURIPS 2025oral