ICCV 2025 "vision-language tasks" Papers
3 papers found
3DGraphLLM: Combining Semantic Graphs and Large Language Models for 3D Scene Understanding
Tatiana Zemskova, Dmitry Yudin
ICCV 2025posterarXiv:2412.18450
11
citations
Cycle Consistency as Reward: Learning Image-Text Alignment without Human Preferences
Hyojin Bahng, Caroline Chan, Fredo Durand et al.
ICCV 2025posterarXiv:2506.02095
7
citations
MIEB: Massive Image Embedding Benchmark
Chenghao Xiao, Isaac Chung, Imene Kerboua et al.
ICCV 2025posterarXiv:2504.10471
6
citations