Yuxiao Dong
14
Papers
1,689
Total Citations
Papers (14)
CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer
ICLR 2025
1,318
citations
LVBench: An Extreme Long Video Understanding Benchmark
ICCV 2025
208
citations
VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents
ICLR 2025
67
citations
Scaling Speech-Text Pre-training with Synthetic Interleaved Data
ICLR 2025
39
citations
CogCoM: A Visual Language Model with Chain-of-Manipulations Reasoning
ICLR 2025arXiv
33
citations
SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models
ICLR 2025
12
citations
TriSampler: A Better Negative Sampling Principle for Dense Retrieval
AAAI 2024arXiv
12
citations
MotionBench: Benchmarking and Improving Fine-grained Video Motion Understanding for Vision Language Models
CVPR 2025
0
citations
VPO: Aligning Text-to-Video Generation Models with Prompt Optimization
ICCV 2025
0
citations
CogAgent: A Visual Language Model for GUI Agents
CVPR 2024
0
citations
Graph Random Neural Networks for Semi-Supervised Learning on Graphs
NeurIPS 2020
0
citations
Open Graph Benchmark: Datasets for Machine Learning on Graphs
NeurIPS 2020
0
citations
Adaptive Diffusion in Graph Neural Networks
NeurIPS 2021
0
citations
ImageReward: Learning and Evaluating Human Preferences for Text-to-Image Generation
NeurIPS 2023
0
citations