Weiming Ren
4
Papers
97
Total Citations
Papers (4)
OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision
ICLR 2025arXiv
88
citations
VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by Video Spatiotemporal Augmentation
CVPR 2025
9
citations
Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers
ICCV 2025
0
citations
MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI
CVPR 2024
0
citations