Xia Hu
8
Papers
21
Total Citations
Papers (8)
TopV: Compatible Token Pruning with Inference Time Optimization for Fast and Low-Memory Multimodal Vision Language Model
CVPR 2025arXiv
20
citations
Flexible Group Count Enables Hassle-Free Structured Pruning
CVPR 2025
1
citations
KIVI: A Tuning-Free Asymmetric 2bit Quantization for KV Cache
ICML 2024
0
citations
TVE: Learning Meta-attribution for Transferable Vision Explainer
ICML 2024
0
citations
Soft Prompt Recovers Compressed LLMs, Transferably
ICML 2024
0
citations
GNNs Also Deserve Editing, and They Need It More Than Once
ICML 2024
0
citations
LLM Maybe LongLM: SelfExtend LLM Context Window Without Tuning
ICML 2024
0
citations
Omni-SMoLA: Boosting Generalist Multimodal Models with Soft Mixture of Low-rank Experts
CVPR 2024
0
citations