ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts

153citations
Project
153
Citations
#72
in CVPR 2024
of 2716 papers
7
Authors
1
Data Points

Citation History

Jan 28, 2026
153