Paper "scene understanding" Papers
8 papers found
Conference
Diffusion Models for Attribution
Xiongren Chen, Jiuyong Li, Jixue Liu et al.
AAAI 2025paperarXiv:2403.14790
12
citations
MM-CamObj: A Comprehensive Multimodal Dataset for Camouflaged Object Scenarios
Jiacheng Ruan, Wenzhen Yuan, Zehao Lin et al.
AAAI 2025paperarXiv:2409.16084
11
citations
Seg2Box: 3D Object Detection by Point-Wise Semantics Supervision
Maoji Zheng, Ziyu Xu, Qiming Xia et al.
AAAI 2025paperarXiv:2503.16811
3
citations
UrBench: A Comprehensive Benchmark for Evaluating Large Multimodal Models in Multi-View Urban Scenarios
Baichuan Zhou, Haote Yang, Dairong Chen et al.
AAAI 2025paperarXiv:2408.17267
26
citations
Bridging the Gap between 2D and 3D Visual Question Answering: A Fusion Approach for 3D VQA
Wentao Mo, Yang Liu
AAAI 2024paperarXiv:2402.15933
26
citations
GSN: Generalisable Segmentation in Neural Radiance Field
Siddharth Barman, Umang Bhaskar, Yeshwant Pandit et al.
AAAI 2024paperarXiv:2402.04632
1
citations
ScanERU: Interactive 3D Visual Grounding Based on Embodied Reference Understanding
Ziyang Lu, Yunqiang Pei, Guoqing Wang et al.
AAAI 2024paperarXiv:2303.13186
12
citations
ViT-Calibrator: Decision Stream Calibration for Vision Transformer
Lin Chen, Zhijie Jia, Lechao Cheng et al.
AAAI 2024paperarXiv:2304.04354
3
citations