Dahun Kim
3
Papers
31
Total Citations
Papers (3)
Mirasol3B: A Multimodal Autoregressive Model for Time-Aligned and Contextual Modalities
CVPR 2024
25
citations
Region-centric Image-Language Pretraining for Open-Vocabulary Detection
ECCV 2024arXiv
6
citations
VideoComp: Advancing Fine-Grained Compositional and Temporal Alignment in Video-Text Models
CVPR 2025
0
citations