Jilan Xu
8
Papers
1,018
Total Citations
Papers (8)
MVBench: A Comprehensive Multi-modal Video Understanding Benchmark
CVPR 2024
864
citations
EgoExoLearn: A Dataset for Bridging Asynchronous Ego- and Exo-centric View of Procedural Activities in Real World
CVPR 2024
84
citations
CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding
ICLR 2025arXiv
41
citations
Modeling Fine-Grained Hand-Object Dynamics for Egocentric Video Representation Learning
ICLR 2025
11
citations
EgoExoBench: A Benchmark for First- and Third-person View Video Understanding in MLLMs
NeurIPS 2025
10
citations
AOR: Anatomical Ontology-Guided Reasoning for Medical Large Multimodal Model in Chest X-Ray Interpretation
NeurIPS 2025
5
citations
Learning Streaming Video Representation via Multitask Training
ICCV 2025
3
citations
Retrieval-Augmented Egocentric Video Captioning
CVPR 2024
0
citations