Mu Cai
5
Papers
218
Total Citations
Papers (5)
ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts
CVPR 2024
153
citations
Matryoshka Multimodal Models
ICLR 2025arXiv
58
citations
Removing Distributional Discrepancies in Captions Improves Image-Text Alignment
ECCV 2024
7
citations
Magma: A Foundation Model for Multimodal AI Agents
CVPR 2025
0
citations
LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models
ICCV 2025
0
citations