Poster "multimodal video understanding" Papers
2 papers found
ConViS-Bench: Estimating Video Similarity Through Semantic Concepts
Benedetta Liberatori, Alessandro Conti, Lorenzo Vaquero et al.
NeurIPS 2025posterarXiv:2509.19245
1
citations
MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos
Xuehai He, Weixi Feng, Kaizhi Zheng et al.
ICLR 2025posterarXiv:2406.08407
34
citations