"long-horizon video generation" Papers
2 papers found
GEM: A Generalizable Ego-Vision Multimodal World Model for Fine-Grained Ego-Motion, Object Dynamics, and Scene Composition Control
Mariam Hassan, Sebastian Stapf, Ahmad Rahimi et al.
CVPR 2025posterarXiv:2412.11198
41
citations
WorldWeaver: Generating Long-Horizon Video Worlds via Rich Perception
Zhiheng Liu, Xueqing Deng, Shoufa Chen et al.
NeurIPS 2025oralarXiv:2508.15720
5
citations