Ming Ding
5
Papers
1,626
Total Citations
23
h-index
Papers (5)
CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer
ICLR 2025
1,318
citations
LVBench: An Extreme Long Video Understanding Benchmark
ICCV 2025
208
citations
VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents
ICLR 2025
67
citations
CogCoM: A Visual Language Model with Chain-of-Manipulations Reasoning
ICLR 2025arXiv
33
citations
CogAgent: A Visual Language Model for GUI Agents
CVPR 2024
0
citations