Hancheng Ye
3
Papers
20
Total Citations
Papers (3)
CoreMatching: A Co-adaptive Sparse Inference Framework with Token and Neuron Pruning for Comprehensive Acceleration of Vision-Language Models
ICML 2025
10
citations
Once for Both: Single Stage of Importance and Sparsity Search for Vision Transformer Compression
CVPR 2024
9
citations
KVCOMM: Online Cross-context KV-cache Communication for Efficient LLM-based Multi-agent Systems
NeurIPS 2025arXiv
1
citations