Paper "inference acceleration" Papers
5 papers found
Conference
AdaDiff: Adaptive Step Selection for Fast Diffusion Models
Hui Zhang, Zuxuan Wu, Zhen Xing et al.
AAAI 2025paperarXiv:2311.14768
20
citations
BlockFFN: Towards End-Side Acceleration-Friendly Mixture-of-Experts with Chunk-Level Activation Sparsity
Chenyang Song, Weilin Zhao, Xu Han et al.
COLM 2025paperarXiv:2507.08771
1
citations
LazyDiT: Lazy Learning for the Acceleration of Diffusion Transformers
Xuan Shen, Zhao Song, Yufa Zhou et al.
AAAI 2025paperarXiv:2412.12444
38
citations
Expediting Contrastive Language-Image Pretraining via Self-Distilled Encoders
Bumsoo Kim, Jinhyung Kim, Yeonsik Jo et al.
AAAI 2024paperarXiv:2312.12659
5
citations
Fluctuation-Based Adaptive Structured Pruning for Large Language Models
Yongqi An, Xu Zhao, Tao Yu et al.
AAAI 2024paperarXiv:2312.11983
106
citations