Poster by Keen You Papers
4 papers found
Contrastive Localized Language-Image Pre-Training
Hong-You Chen, Zhengfeng Lai, Haotian Zhang et al.
ICML 2025posterarXiv:2410.02746
Ferret-UI 2: Mastering Universal User Interface Understanding Across Platforms
Zhangheng LI, Keen You, Haotian Zhang et al.
ICLR 2025posterarXiv:2410.18967
43
citations
MM1.5: Methods, Analysis & Insights from Multimodal LLM Fine-tuning
Haotian Zhang, Mingfei Gao, Zhe Gan et al.
ICLR 2025posterarXiv:2409.20566
Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs
Keen You, Haotian Zhang, Eldon Schoop et al.
ECCV 2024posterarXiv:2404.05719
154
citations