ICLR 2025 "layer pruning" Papers
2 papers found
Streamlining Redundant Layers to Compress Large Language Models
Xiaodong Chen, Yuxuan Hu, Jing Zhang et al.
ICLR 2025posterarXiv:2403.19135
15
citations
The Unreasonable Ineffectiveness of the Deeper Layers
Andrey Gromov, Kushal Tirumala, Hassan Shapourian et al.
ICLR 2025posterarXiv:2403.17887
160
citations