ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts

153citations
Project
153
Citations
7
Authors
1
Data Points

Citation History

Jan 28, 2026
153