CVPR 2025 "referring expression comprehension" Papers
4 papers found
Ground-V: Teaching VLMs to Ground Complex Instructions in Pixels
Yongshuo Zong, Qin ZHANG, DONGSHENG An et al.
CVPR 2025posterarXiv:2505.13788
3
citations
Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks
Miran Heo, Min-Hung Chen, De-An Huang et al.
CVPR 2025posterarXiv:2501.08326
9
citations
Synthetic Visual Genome
Jae Sung Park, Zixian Ma, Linjie Li et al.
CVPR 2025posterarXiv:2506.07643
2
citations
WeakMCN: Multi-task Collaborative Network for Weakly Supervised Referring Expression Comprehension and Segmentation
Silin Cheng, Yang Liu, Xinwei He et al.
CVPR 2025posterarXiv:2505.18686
3
citations