Poster "weight quantization" Papers
5 papers found
Cauchy-Schwarz Regularizers
Sueda Taner, Ziyi Wang, Christoph Studer
ICLR 2025posterarXiv:2503.01639
Training-Free Activation Sparsity in Large Language Models
James Liu, Pragaash Ponnusamy, Tianle Cai et al.
ICLR 2025posterarXiv:2408.14690
37
citations
A2Q+: Improving Accumulator-Aware Weight Quantization
Ian Colbert, Alessandro Pappalardo, Jakoba Petri-Koenig et al.
ICML 2024poster
ExCP: Extreme LLM Checkpoint Compression via Weight-Momentum Joint Shrinking
Wenshuo Li, Xinghao Chen, Han Shu et al.
ICML 2024poster
Extreme Compression of Large Language Models via Additive Quantization
Vage Egiazarian, Andrei Panferov, Denis Kuznedelev et al.
ICML 2024poster