Weihan Wang
7
Papers
1,559
Total Citations
Papers (7)
CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer
ICLR 2025
1,318
citations
LVBench: An Extreme Long Video Understanding Benchmark
ICCV 2025
208
citations
CogCoM: A Visual Language Model with Chain-of-Manipulations Reasoning
ICLR 2025arXiv
33
citations
MotionBench: Benchmarking and Improving Fine-grained Video Motion Understanding for Vision Language Models
CVPR 2025
0
citations
CogAgent: A Visual Language Model for GUI Agents
CVPR 2024
0
citations
Learning the Distribution of Errors in Stereo Matching for Joint Disparity and Uncertainty Estimation
CVPR 2023arXiv
0
citations
ViLTA: Enhancing Vision-Language Pre-training through Textual Augmentation
ICCV 2023arXiv
0
citations