2024 Poster "multimodal reasoning" Papers
3 papers found
OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentation
Zhening Huang, Xiaoyang Wu, Xi Chen et al.
ECCV 2024posterarXiv:2309.00616
82
citations
RoboCodeX: Multimodal Code Generation for Robotic Behavior Synthesis
Yao Mu, Junting Chen, Qing-Long Zhang et al.
ICML 2024poster
Vamos: Versatile Action Models for Video Understanding
Shijie Wang, Qi Zhao, Minh Quan et al.
ECCV 2024posterarXiv:2311.13627
36
citations