Jianyi Zhang
3
Papers
11
Total Citations
Papers (3)
CoreMatching: A Co-adaptive Sparse Inference Framework with Token and Neuron Pruning for Comprehensive Acceleration of Vision-Language Models
ICML 2025
10
citations
KVCOMM: Online Cross-context KV-cache Communication for Efficient LLM-based Multi-agent Systems
NeurIPS 2025arXiv
1
citations
Keyframe-oriented Vision Token Pruning: Enhancing Efficiency of Large Vision Language Models on Long-Form Video Processing
ICCV 2025
0
citations