Yong Jae Lee
10
Papers
180
Total Citations
Papers (10)
ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts
CVPR 2024
153
citations
X-Fusion: Introducing New Modality to Frozen Large Language Models
ICCV 2025
8
citations
Building a Mind Palace: Structuring Environment-Grounded Semantic Graphs for Effective Long Video Analysis with LLMs
CVPR 2025arXiv
7
citations
Removing Distributional Discrepancies in Captions Improves Image-Text Alignment
ECCV 2024
7
citations
Edit One for All: Interactive Batch Image Editing
CVPR 2024
5
citations
Improved Baselines with Visual Instruction Tuning
CVPR 2024
0
citations
CuRe: Cultural Gaps in the Long Tail of Text-to-Image Systems
ICCV 2025
0
citations
LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models
ICCV 2025
0
citations
Customizing Domain Adapters for Domain Generalization
ICCV 2025
0
citations
Yo’Chameleon: Personalized Vision and Language Generation
CVPR 2025
0
citations