2025 Poster "multi-modal fusion" Papers
2 papers found
Grounding 3D Object Affordance with Language Instructions, Visual Observations and Interactions
He Zhu, Quyu Kong, Kechun Xu et al.
CVPR 2025posterarXiv:2504.04744
6
citations
Tri-MARF: A Tri-Modal Multi-Agent Responsive Framework for Comprehensive 3D Object Annotation
jusheng zhang, Yijia Fan, Zimo Wen et al.
NeurIPS 2025poster