by Alexander M Rush Papers
3 papers found
Conference
A Controlled Study on Long Context Extension and Generalization in LLMs
Yi Lu, Jing Nathan Yan, Songlin Yang et al.
COLM 2025paperarXiv:2409.12181
18
citations
Approximating Language Model Training Data from Weights
John Xavier Morris, Junjie Oscar Yin, Woojeong Kim et al.
COLM 2025paperarXiv:2506.15553
2
citations
Overfill: Two-Stage Models for Efficient Language Model Decoding
Woojeong Kim, Junxiong Wang, Jing Nathan Yan et al.
COLM 2025paperarXiv:2508.08446