2024 "model pruning" Papers
10 papers found
A Provably Effective Method for Pruning Experts in Fine-tuned Sparse Mixture-of-Experts
Mohammed Nowaz Rabbani Chowdhury, Meng Wang, Kaoutar El Maghraoui et al.
ICML 2024poster
COPAL: Continual Pruning in Large Language Generative Models
Srikanth Malla, Joon Hee Choi, Chiho Choi
ICML 2024poster
FedLPS: Heterogeneous Federated Learning for Multiple Tasks with Local Parameter Sharing
Yongzhe Jia, Xuyun Zhang, Amin Beheshti et al.
AAAI 2024paperarXiv:2402.08578
10
citations
How Do Nonlinear Transformers Learn and Generalize in In-Context Learning?
Hongkang Li, Meng Wang, Songtao Lu et al.
ICML 2024poster
Junk DNA Hypothesis: Pruning Small Pre-Trained Weights $\textit{Irreversibly}$ and $\textit{Monotonically}$ Impairs ``Difficult" Downstream Tasks in LLMs
Lu Yin, Ajay Jaiswal, Shiwei Liu et al.
ICML 2024poster
Mixture of Efficient Diffusion Experts Through Automatic Interval and Sub-Network Selection
Alireza Ganjdanesh, Yan Kang, Yuchen Liu et al.
ECCV 2024posterarXiv:2409.15557
12
citations
Non-transferable Pruning
Ruyi Ding, Lili Su, A. Adam Ding et al.
ECCV 2024posterarXiv:2410.08015
3
citations
Outlier Weighed Layerwise Sparsity (OWL): A Missing Secret Sauce for Pruning LLMs to High Sparsity
Lu Yin, You Wu, Zhenyu Zhang et al.
ICML 2024poster
Sparsest Models Elude Pruning: An Exposé of Pruning’s Current Capabilities
Stephen Zhang, Vardan Papyan
ICML 2024poster
Unveiling the Dynamics of Information Interplay in Supervised Learning
Kun Song, Zhiquan Tan, Bochao Zou et al.
ICML 2024poster