by Akshita Bhagia Papers
4 papers found
DataDecide: How to Predict Best Pretraining Data with Small Experiments
Ian Magnusson, Tai Nguyen, Ben Bogin et al.
ICML 2025poster
FlexOLMo: Open Language Models for Flexible Data Use
Weijia Shi, Akshita Bhagia, Kevin Farhat et al.
NeurIPS 2025spotlight
OLMoE: Open Mixture-of-Experts Language Models
Niklas Muennighoff, Luca Soldaini, Dirk Groeneveld et al.
ICLR 2025poster
What's In My Big Data?
Yanai Elazar, Akshita Bhagia, Ian Magnusson et al.
ICLR 2024spotlight