Jilan Xu
10
Papers
1,016
Total Citations
Papers (10)
MVBench: A Comprehensive Multi-modal Video Understanding Benchmark
CVPR 2024
864
citations
EgoExoLearn: A Dataset for Bridging Asynchronous Ego- and Exo-centric View of Procedural Activities in Real World
CVPR 2024
84
citations
CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding
ICLR 2025
39
citations
Modeling Fine-Grained Hand-Object Dynamics for Egocentric Video Representation Learning
ICLR 2025
11
citations
EgoExoBench: A Benchmark for First- and Third-person View Video Understanding in MLLMs
NeurIPS 2025
10
citations
AOR: Anatomical Ontology-Guided Reasoning for Medical Large Multimodal Model in Chest X-Ray Interpretation
NeurIPS 2025
5
citations
Learning Streaming Video Representation via Multitask Training
ICCV 2025
3
citations
Learning Open-Vocabulary Semantic Segmentation Models From Natural Language Supervision
CVPR 2023arXiv
0
citations
Retrieval-Augmented Egocentric Video Captioning
CVPR 2024
0
citations
CREAM: Weakly Supervised Object Localization via Class RE-Activation Mapping
CVPR 2022arXiv
0
citations