"training-free compression" Papers
3 papers found
BitStack: Any-Size Compression of Large Language Models in Variable Memory Environments
Xinghao Wang, Pengyu Wang, Bo Wang et al.
ICLR 2025posterarXiv:2410.23918
5
citations
InfiniPot-V: Memory-Constrained KV Cache Compression for Streaming Video Understanding
Minsoo Kim, Kyuhong Shim, Jungwook Choi et al.
NEURIPS 2025oralarXiv:2506.15745
12
citations
PaPr: Training-Free One-Step Patch Pruning with Lightweight ConvNets for Faster Inference
Tanvir Mahmud, Burhaneddin Yaman, Chun-Hao Liu et al.
ECCV 2024posterarXiv:2403.16020
7
citations