ICCV "multimodal reasoning" Papers
3 papers found
MMAT-1M: A Large Reasoning Dataset for Multimodal Agent Tuning
Tianhong Gao, Yannian Fu, Weiqun Wu et al.
ICCV 2025posterarXiv:2507.21924
1
citations
Towards Omnimodal Expressions and Reasoning in Referring Audio-Visual Segmentation
Kaining Ying, Henghui Ding, Guangquan Jie et al.
ICCV 2025posterarXiv:2507.22886
5
citations
ViLLa: Video Reasoning Segmentation with Large Language Model
rongkun Zheng, Lu Qi, Xi Chen et al.
ICCV 2025posterarXiv:2407.14500
16
citations