ICLR 2025 "language model training" Papers
6 papers found
Aioli: A Unified Optimization Framework for Language Model Data Mixing
Mayee Chen, Michael Hu, Nicholas Lourie et al.
ICLR 2025posterarXiv:2411.05735
16
citations
Deconstructing What Makes a Good Optimizer for Autoregressive Language Models
Rosie Zhao, Depen Morwani, David Brandfonbrener et al.
ICLR 2025poster
37
citations
Generative Representational Instruction Tuning
Niklas Muennighoff, Hongjin SU, Liang Wang et al.
ICLR 2025posterarXiv:2402.09906
217
citations
Gradient descent with generalized Newton’s method
Zhiqi Bu, Shiyun Xu
ICLR 2025posterarXiv:2407.02772
6
citations
Inverse Scaling: When Bigger Isn't Better
Joe Cavanagh, Andrew Gritsevskiy, Najoung Kim et al.
ICLR 2025posterarXiv:2306.09479
183
citations
Learning from negative feedback, or positive feedback or both
Abbas Abdolmaleki, Bilal Piot, Bobak Shahriari et al.
ICLR 2025posterarXiv:2410.04166
7
citations