"transformer" Papers
3 papers found
Conference
Adaptive Computation Pruning for the Forgetting Transformer
Zhixuan Lin, Johan Obando-Ceron, Xu Owen He et al.
COLM 2025paperarXiv:2504.06949
3
citations
KVSink: Understanding and Enhancing the Preservation of Attention Sinks in KV Cache Quantization for LLMs
Zunhai Su, Kehong Yuan
COLM 2025paperarXiv:2508.04257
8
citations
Rethinking Associative Memory Mechanism in Induction Head
Shuo Wang, Issei Sato
COLM 2025paper