Oral "scene understanding" Papers
6 papers found
Conference
CSV-Occ: Fusing Multi-frame Alignment for Occupancy Prediction with Temporal Cross State Space Model and Central Voting Mechanism
Ziming Zhu, Yu Zhu, Jiahao Chen et al.
ICML 2025oral
DriveTransformer: Unified Transformer for Scalable End-to-End Autonomous Driving
Xiaosong Jia, Junqi You, Zhiyuan Zhang et al.
ICLR 2025oralarXiv:2503.07656
70
citations
Flow Matching-Based Autonomous Driving Planning with Advanced Interactive Behavior Modeling
Tianyi Tan, Yinan Zheng, Ruiming Liang et al.
NEURIPS 2025oralarXiv:2510.11083
5
citations
MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding
Fei Wang, XINGYU FU, James Y. Huang et al.
ICLR 2025oralarXiv:2406.09411
120
citations
Multi-scale Temporal Prediction via Incremental Generation and Multi-agent Collaboration
Zhitao Zeng, Guojian Yuan, Junyuan Mao et al.
NEURIPS 2025oralarXiv:2509.17429
Spiking Vision Transformer with Saccadic Attention
Shuai Wang, Malu Zhang, Dehao Zhang et al.
ICLR 2025oralarXiv:2502.12677
17
citations