Poster "language model scaling" Papers
6 papers found
Exploiting Vocabulary Frequency Imbalance in Language Model Pre-training
Woojin Chung, Jeonghoon Kim
NeurIPS 2025posterarXiv:2508.15390
1
citations
Scaling Laws for Precision
Tanishq Kumar, Zachary Ankner, Benjamin Spector et al.
ICLR 2025posterarXiv:2411.04330
65
citations
Beyond Chinchilla-Optimal: Accounting for Inference in Language Model Scaling Laws
Nikhil Sardana, Jacob Portes, Alexandre (Sasha) Doubov et al.
ICML 2024poster
Data Engineering for Scaling Language Models to 128K Context
Yao Fu, Rameswar Panda, Xinyao Niu et al.
ICML 2024poster
Mechanistic Design and Scaling of Hybrid Architectures
Michael Poli, Armin Thomas, Eric Nguyen et al.
ICML 2024poster
Why Larger Language Models Do In-context Learning Differently?
Zhenmei Shi, Junyi Wei, Zhuoyan Xu et al.
ICML 2024poster