Yiyi Zhou
8
Papers
98
Total Citations
Papers (8)
Fit and Prune: Fast and Training-free Visual Token Pruning for Multi-modal Large Language Models
AAAI 2025
52
citations
Towards Efficient Diffusion-Based Image Editing with Instant Attention Masks
AAAI 2024arXiv
19
citations
What Kind of Visual Tokens Do We Need? Training-Free Visual Token Pruning for Multi-Modal Large Language Models from the Perspective of Graph
AAAI 2025
18
citations
Accelerating Multimodal Large Language Models via Dynamic Visual-Token Exit and the Empirical Findings
NeurIPS 2025arXiv
5
citations
FlashSloth : Lightning Multimodal Large Language Models via Embedded Visual Compression
CVPR 2025
4
citations
SVFR: A Unified Framework for Generalized Video Face Restoration
CVPR 2025
0
citations
DViN: Dynamic Visual Routing Network for Weakly Supervised Referring Expression Comprehension
CVPR 2025
0
citations
Fast Text-to-3D-Aware Face Generation and Manipulation via Direct Cross-modal Mapping and Geometric Regularization
ICML 2024
0
citations