2024 "language model training" Papers
6 papers found
Better & Faster Large Language Models via Multi-token Prediction
Fabian Gloeckle, Badr Youbi Idrissi, Baptiste Roziere et al.
ICML 2024poster
DsDm: Model-Aware Dataset Selection with Datamodels
Logan Engstrom
ICML 2024spotlight
Fewer Truncations Improve Language Modeling
Hantian Ding, Zijian Wang, Giovanni Paolini et al.
ICML 2024poster
Improving Instruction Following in Language Models through Proxy-Based Uncertainty Estimation
JoonHo Lee, Jae Oh Woo, Juree Seok et al.
ICML 2024poster
QuRating: Selecting High-Quality Data for Training Language Models
Alexander Wettig, Aatmik Gupta, Saumya Malik et al.
ICML 2024spotlight
ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models
Ziniu Li, Tian Xu, Yushun Zhang et al.
ICML 2024poster