"referring audio-visual segmentation" Papers
2 papers found
Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System Collaboration
Hao Zhong, Muzhi Zhu, Zongze Du et al.
NeurIPS 2025oralarXiv:2505.20256
12
citations
Towards Omnimodal Expressions and Reasoning in Referring Audio-Visual Segmentation
Kaining Ying, Henghui Ding, Guangquan Jie et al.
ICCV 2025posterarXiv:2507.22886
5
citations