2025 "dynamic scene understanding" Papers
9 papers found
Layered Motion Fusion: Lifting Motion Segmentation to 3D in Egocentric Videos
Vadim Tschernezki, Diane Larlus, Andrea Vedaldi et al.
CVPR 2025posterarXiv:2506.05546
MoSiC: Optimal-Transport Motion Trajectory for Dense Self-Supervised Learning
Mohammadreza Salehi, Shashanka Venkataramanan, Ioana Simion et al.
ICCV 2025posterarXiv:2506.08694
1
citations
RTV-Bench: Benchmarking MLLM Continuous Perception, Understanding and Reasoning through Real-Time Video
ShuHang Xun, Sicheng Tao, Jungang Li et al.
NEURIPS 2025posterarXiv:2505.02064
5
citations
SAMPO: Scale-wise Autoregression with Motion Prompt for Generative World Models
Sen Wang, Jingyi Tian, Le Wang et al.
NEURIPS 2025oral
SAVVY: Spatial Awareness via Audio-Visual LLMs through Seeing and Hearing
Mingfei Chen, Zijun Cui, Xiulong Liu et al.
NEURIPS 2025oralarXiv:2506.05414
5
citations
Situat3DChange: Situated 3D Change Understanding Dataset for Multimodal Large Language Model
Ruiping Liu, Junwei Zheng, Yufan Chen et al.
NEURIPS 2025posterarXiv:2510.11509
Track3R: Joint Point Map and Trajectory Prior for Spatiotemporal 3D Understanding
Seong Hyeon Park, Jinwoo Shin
NEURIPS 2025oral
Uni4D: Unifying Visual Foundation Models for 4D Modeling from a Single Video
David Yifan Yao, Albert J. Zhai, Shenlong Wang
CVPR 2025highlightarXiv:2503.21761
14
citations
VLM4D: Towards Spatiotemporal Awareness in Vision Language Models
Shijie Zhou, Alexander Vilesov, Xuehai He et al.
ICCV 2025posterarXiv:2508.02095
15
citations