2025 Poster "dataset curation" Papers
3 papers found
Filter Like You Test: Data-Driven Data Filtering for CLIP Pretraining
Mikey Shechter, Yair Carmon
NeurIPS 2025posterarXiv:2503.08805
1
citations
Paint by Inpaint: Learning to Add Image Objects by Removing Them First
Navve Wasserman, Noam Rotstein, Roy Ganz et al.
CVPR 2025posterarXiv:2404.18212
29
citations
The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text
Nikhil Kandpal, Brian Lester, Colin Raffel et al.
NeurIPS 2025posterarXiv:2506.05209
10
citations