Weiming Ren
3
Papers
9
Total Citations
Papers (3)
VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by Video Spatiotemporal Augmentation
CVPR 2025
9
citations
Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers
ICCV 2025
0
citations
MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI
CVPR 2024
0
citations