Yuhao Dong
9
Papers
41
Total Citations
Papers (9)
Efficient Inference of Vision Instruction-Following Models with Elastic Cache
ECCV 2024
25
citations
Coarse Correspondences Boost Spatial-Temporal Reasoning in Multimodal Language Model
CVPR 2025arXiv
9
citations
ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models
NeurIPS 2025
7
citations
Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data and Metric Perspectives
ICCV 2025
0
citations
Towards Video Thinking Test: A Holistic Benchmark for Advanced Video Reasoning and Understanding
ICCV 2025
0
citations
3DTopia-XL: Scaling High-quality 3D Asset Generation via Primitive Diffusion
CVPR 2025
0
citations
Complete-to-Partial 4D Distillation for Self-Supervised Point Cloud Sequence Representation Learning
CVPR 2023arXiv
0
citations
Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models
CVPR 2025
0
citations
EgoLife: Towards Egocentric Life Assistant
CVPR 2025
0
citations