"efficiency" Papers
3 papers found
Conference
Fluid Language Model Benchmarking
Valentin Hofmann, David Heineman, Ian Magnusson et al.
COLM 2025paperarXiv:2509.11106
10
citations
StagFormer: Time Staggering Decoder only Transformers
Dylan J Cutler, Arun Kandoor, Nishanth Dikkala et al.
COLM 2025paper
SuperBPE: Space Travel for Language Models
Alisa Liu, Jonathan Hayase, Valentin Hofmann et al.
COLM 2025paperarXiv:2503.13423
34
citations