Xiaodong Cun

29
Papers
981
Total Citations
1
Affiliations

Affiliations

Great Bay University

Papers (29)

Follow Your Pose: Pose-Guided Text-to-Video Generation Using Pose-Free Videos

AAAI 2024arXiv
276
citations

EvalCrafter: Benchmarking and Evaluating Large Video Generation Models

CVPR 2024
237
citations

SmartEdit: Exploring Complex Instruction-based Image Editing with Multimodal Large Language Models

CVPR 2024
139
citations

ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with Diffusion Models

ICLR 2024
110
citations

DEIM: DETR with Improved Matching for Fast Convergence

CVPR 2025
93
citations

Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation

ECCV 2024
50
citations

DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation

CVPR 2025
44
citations

Make-Your-Anchor: A Diffusion-based 2D Avatar Generation Framework

CVPR 2024
29
citations

Noise Calibration: Plug-and-play Content-Preserving Video Enhancement using Pre-trained Video Diffusion Models

ECCV 2024arXiv
3
citations

CodeTalker: Speech-Driven 3D Facial Animation With Discrete Motion Prior

CVPR 2023arXiv
0
citations

SadTalker: Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

CVPR 2023arXiv
0
citations

Explicit Visual Prompting for Low-Level Structure Segmentations

CVPR 2023arXiv
0
citations

LivelySpeaker: Towards Semantic-Aware Co-Speech Gesture Generation

ICCV 2023arXiv
0
citations

FateZero: Fusing Attentions for Zero-shot Text-based Video Editing

ICCV 2023arXiv
0
citations

ToonTalker: Cross-Domain Face Reenactment

ICCV 2023arXiv
0
citations

High-Resolution Document Shadow Removal via A Large-Scale Real-World Dataset and A Frequency-Aware Shadow Erasing Net

ICCV 2023arXiv
0
citations

Defocus Blur Detection via Depth Distillation

ECCV 2020
0
citations

Spatial-Separated Curve Rendering Network for Efficient and High-Resolution Image Harmonization

ECCV 2022
0
citations

StyleHEAT: One-Shot High-Resolution Editable Talking Face Generation via Pre-trained StyleGAN

ECCV 2022
0
citations

Uformer: A General U-Shaped Transformer for Image Restoration

CVPR 2022
0
citations

DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos

CVPR 2025
0
citations

CustomTTT: Motion and Appearance Customized Video Generation via Test-Time Training

AAAI 2025
0
citations

X-Adapter: Adding Universal Compatibility of Plugins for Upgraded Diffusion Model

CVPR 2024
0
citations

VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models

CVPR 2024
0
citations

Depth-aware Test-Time Training for Zero-shot Video Object Segmentation

CVPR 2024
0
citations

3D GAN Inversion With Facial Symmetry Prior

CVPR 2023arXiv
0
citations

Generating Human Motion From Textual Descriptions With Discrete Representations

CVPR 2023arXiv
0
citations

DPE: Disentanglement of Pose and Expression for General Video Portrait Editing

CVPR 2023arXiv
0
citations

Inserting Anybody in Diffusion Models via Celeb Basis

NeurIPS 2023
0
citations