Xintao Wang

25
Papers
2,824
Total Citations
1
Affiliations

Affiliations

The Chinese University of Hong Kong

Papers (25)

T2I-Adapter: Learning Adapters to Dig Out More Controllable Ability for Text-to-Image Diffusion

AAAI 2024arXiv
1,423
citations

Follow Your Pose: Pose-Guided Text-to-Video Generation Using Pose-Free Videos

AAAI 2024arXiv
276
citations

EvalCrafter: Benchmarking and Evaluating Large Video Generation Models

CVPR 2024
237
citations

BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion

ECCV 2024arXiv
163
citations

SmartEdit: Exploring Complex Instruction-based Image Editing with Multimodal Large Language Models

CVPR 2024
139
citations

ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with Diffusion Models

ICLR 2024
110
citations

Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners

CVPR 2024
109
citations

Improving Video Generation with Human Feedback

NeurIPS 2025
106
citations

DiffEditor: Boosting Accuracy and Flexibility on Diffusion-based Image Editing

CVPR 2024
89
citations

GameFactory: Creating New Games with Generative Interactive Videos

ICCV 2025
63
citations

Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation

ECCV 2024
50
citations

Image Conductor: Precision Control for Interactive Video Synthesis

AAAI 2025
46
citations

SketchVideo: Sketch-based Video Generation and Editing

CVPR 2025
8
citations

PatchVSR: Breaking Video Diffusion Resolution Limits with Patch-wise Video Super-Resolution

CVPR 2025
3
citations

Anti-Diffusion: Preventing Abuse of Modifications of Diffusion-Based Models

AAAI 2025
2
citations

FullDiT: Video Generative Foundation Models with Multimodal Control via Full Attention

ICCV 2025
0
citations

Scaling Up to Excellence: Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild

CVPR 2024
0
citations

ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

ICCV 2025
0
citations

X-Adapter: Adding Universal Compatibility of Plugins for Upgraded Diffusion Model

CVPR 2024
0
citations

Rethinking the Objectives of Vector-Quantized Tokenizers for Image Synthesis

CVPR 2024
0
citations

VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models

CVPR 2024
0
citations

PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding

CVPR 2024
0
citations

StyleMaster: Stylize Your Video with Artistic Generation and Translation

CVPR 2025
0
citations

Unifying Image Processing as Visual Prompting Question Answering

ICML 2024
0
citations

CustomCrafter: Customized Video Generation with Preserving Motion and Concept Composition Abilities

AAAI 2025
0
citations