Paper "inference acceleration" Papers
2 papers found
Expediting Contrastive Language-Image Pretraining via Self-Distilled Encoders
Bumsoo Kim, Jinhyung Kim, Yeonsik Jo et al.
AAAI 2024paperarXiv:2312.12659
5
citations
Fluctuation-Based Adaptive Structured Pruning for Large Language Models
Yongqi An, Xu Zhao, Tao Yu et al.
AAAI 2024paperarXiv:2312.11983
96
citations