CVPR 2025 "model compression" Papers
5 papers found
CASP: Compression of Large Multimodal Models Based on Attention Sparsity
Mohsen Gholami, Mohammad Akbari, Kevin Cannons et al.
CVPR 2025highlightarXiv:2503.05936
2
citations
DeRS: Towards Extremely Efficient Upcycled Mixture-of-Experts Models
Yongqi Huang, Peng Ye, Chenyu Huang et al.
CVPR 2025posterarXiv:2503.01359
6
citations
EdgeTAM: On-Device Track Anything Model
Chong Zhou, Chenchen Zhu, Yunyang Xiong et al.
CVPR 2025posterarXiv:2501.07256
8
citations
FIMA-Q: Post-Training Quantization for Vision Transformers by Fisher Information Matrix Approximation
Zhuguanyu Wu, Shihe Wang, Jiayi Zhang et al.
CVPR 2025highlightarXiv:2506.11543
5
citations
Quantization without Tears
Minghao Fu, Hao Yu, Jie Shao et al.
CVPR 2025posterarXiv:2411.13918
14
citations