Weiming Ren

3

Papers

9

Total Citations

Papers (3)

VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by Video Spatiotemporal Augmentation

Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers

MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI