ICML "language model training" Papers
6 papers found
Better & Faster Large Language Models via Multi-token Prediction
Fabian Gloeckle, Badr Youbi Idrissi, Baptiste Roziere et al.
ICML 2024posterarXiv:2404.19737
DsDm: Model-Aware Dataset Selection with Datamodels
Logan Engstrom
ICML 2024spotlightarXiv:2401.12926
Fewer Truncations Improve Language Modeling
Hantian Ding, Zijian Wang, Giovanni Paolini et al.
ICML 2024posterarXiv:2404.10830
Improving Instruction Following in Language Models through Proxy-Based Uncertainty Estimation
JoonHo Lee, Jae Oh Woo, Juree Seok et al.
ICML 2024posterarXiv:2405.06424
QuRating: Selecting High-Quality Data for Training Language Models
Alexander Wettig, Aatmik Gupta, Saumya Malik et al.
ICML 2024spotlightarXiv:2402.09739
ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models
Ziniu Li, Tian Xu, Yushun Zhang et al.
ICML 2024posterarXiv:2310.10505