Xinwei He
3
Papers
27
Total Citations
Papers (3)
LLaVA-KD: A Framework of Distilling Multimodal Large Language Models
ICCV 2025arXiv
23
citations
WeakMCN: Multi-task Collaborative Network for Weakly Supervised Referring Expression Comprehension and Segmentation
CVPR 2025
3
citations
Describe, Adapt and Combine: Empowering CLIP Encoders for Open-set 3D Object Retrieval
ICCV 2025
1
citations