ICLR "language model training" Papers
3 papers found
Aioli: A Unified Optimization Framework for Language Model Data Mixing
Mayee Chen, Michael Hu, Nicholas Lourie et al.
ICLR 2025posterarXiv:2411.05735
16
citations
Generative Representational Instruction Tuning
Niklas Muennighoff, Hongjin SU, Liang Wang et al.
ICLR 2025posterarXiv:2402.09906
214
citations
Learning from negative feedback, or positive feedback or both
Abbas Abdolmaleki, Bilal Piot, Bobak Shahriari et al.
ICLR 2025posterarXiv:2410.04166
7
citations