2025 "training-free compression" Papers
2 papers found
BitStack: Any-Size Compression of Large Language Models in Variable Memory Environments
Xinghao Wang, Pengyu Wang, Bo Wang et al.
ICLR 2025posterarXiv:2410.23918
5
citations
InfiniPot-V: Memory-Constrained KV Cache Compression for Streaming Video Understanding
Minsoo Kim, Kyuhong Shim, Jungwook Choi et al.
NEURIPS 2025oralarXiv:2506.15745
12
citations