CVPR "multi-modal llms" Papers
2 papers found
Distilling Multi-modal Large Language Models for Autonomous Driving
Deepti Hegde, Rajeev Yasarla, Hong Cai et al.
CVPR 2025posterarXiv:2501.09757
27
citations
Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis
Chaoyou Fu, Yuhan Dai, Yongdong Luo et al.
CVPR 2025highlightarXiv:2405.21075
876
citations