Shuhuai Ren
5
Papers
1,280
Total Citations
Papers (5)
Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis
CVPR 2025
858
citations
TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding
CVPR 2024
356
citations
VITATECS: A Diagnostic Dataset for Temporal Concept Understanding of Video-Language Models
ECCV 2024
49
citations
Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation
ICCV 2025
17
citations
Parallelized Autoregressive Visual Generation
CVPR 2025
0
citations