NEURIPS Oral "vision language models" Papers
2 papers found
DenseDPO: Fine-Grained Temporal Preference Optimization for Video Diffusion Models
Ziyi Wu, Anil Kag, Ivan Skorokhodov et al.
NEURIPS 2025oralarXiv:2506.03517
11
citations
MEgoHand: Multimodal Egocentric Hand-Object Interaction Motion Generation
Bohan Zhou, Yi Zhan, Zhongbin Zhang et al.
NEURIPS 2025oralarXiv:2505.16602
3
citations