Xiaodong Cun
29
Papers
981
Total Citations
1
Affiliations
Affiliations
Great Bay University
Papers (29)
Follow Your Pose: Pose-Guided Text-to-Video Generation Using Pose-Free Videos
AAAI 2024arXiv
276
citations
EvalCrafter: Benchmarking and Evaluating Large Video Generation Models
CVPR 2024
237
citations
SmartEdit: Exploring Complex Instruction-based Image Editing with Multimodal Large Language Models
CVPR 2024
139
citations
ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with Diffusion Models
ICLR 2024
110
citations
DEIM: DETR with Improved Matching for Fast Convergence
CVPR 2025
93
citations
Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation
ECCV 2024
50
citations
DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation
CVPR 2025
44
citations
Make-Your-Anchor: A Diffusion-based 2D Avatar Generation Framework
CVPR 2024
29
citations
Noise Calibration: Plug-and-play Content-Preserving Video Enhancement using Pre-trained Video Diffusion Models
ECCV 2024arXiv
3
citations
CodeTalker: Speech-Driven 3D Facial Animation With Discrete Motion Prior
CVPR 2023arXiv
0
citations
SadTalker: Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
CVPR 2023arXiv
0
citations
Explicit Visual Prompting for Low-Level Structure Segmentations
CVPR 2023arXiv
0
citations
LivelySpeaker: Towards Semantic-Aware Co-Speech Gesture Generation
ICCV 2023arXiv
0
citations
FateZero: Fusing Attentions for Zero-shot Text-based Video Editing
ICCV 2023arXiv
0
citations
ToonTalker: Cross-Domain Face Reenactment
ICCV 2023arXiv
0
citations
High-Resolution Document Shadow Removal via A Large-Scale Real-World Dataset and A Frequency-Aware Shadow Erasing Net
ICCV 2023arXiv
0
citations
Defocus Blur Detection via Depth Distillation
ECCV 2020
0
citations
Spatial-Separated Curve Rendering Network for Efficient and High-Resolution Image Harmonization
ECCV 2022
0
citations
StyleHEAT: One-Shot High-Resolution Editable Talking Face Generation via Pre-trained StyleGAN
ECCV 2022
0
citations
Uformer: A General U-Shaped Transformer for Image Restoration
CVPR 2022
0
citations
DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos
CVPR 2025
0
citations
CustomTTT: Motion and Appearance Customized Video Generation via Test-Time Training
AAAI 2025
0
citations
X-Adapter: Adding Universal Compatibility of Plugins for Upgraded Diffusion Model
CVPR 2024
0
citations
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
CVPR 2024
0
citations
Depth-aware Test-Time Training for Zero-shot Video Object Segmentation
CVPR 2024
0
citations
3D GAN Inversion With Facial Symmetry Prior
CVPR 2023arXiv
0
citations
Generating Human Motion From Textual Descriptions With Discrete Representations
CVPR 2023arXiv
0
citations
DPE: Disentanglement of Pose and Expression for General Video Portrait Editing
CVPR 2023arXiv
0
citations
Inserting Anybody in Diffusion Models via Celeb Basis
NeurIPS 2023
0
citations