Xintao Wang
25
Papers
2,824
Total Citations
1
Affiliations
Affiliations
The Chinese University of Hong Kong
Papers (25)
T2I-Adapter: Learning Adapters to Dig Out More Controllable Ability for Text-to-Image Diffusion
AAAI 2024arXiv
1,423
citations
Follow Your Pose: Pose-Guided Text-to-Video Generation Using Pose-Free Videos
AAAI 2024arXiv
276
citations
EvalCrafter: Benchmarking and Evaluating Large Video Generation Models
CVPR 2024
237
citations
BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion
ECCV 2024arXiv
163
citations
SmartEdit: Exploring Complex Instruction-based Image Editing with Multimodal Large Language Models
CVPR 2024
139
citations
ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with Diffusion Models
ICLR 2024
110
citations
Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners
CVPR 2024
109
citations
Improving Video Generation with Human Feedback
NeurIPS 2025
106
citations
DiffEditor: Boosting Accuracy and Flexibility on Diffusion-based Image Editing
CVPR 2024
89
citations
GameFactory: Creating New Games with Generative Interactive Videos
ICCV 2025
63
citations
Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation
ECCV 2024
50
citations
Image Conductor: Precision Control for Interactive Video Synthesis
AAAI 2025
46
citations
SketchVideo: Sketch-based Video Generation and Editing
CVPR 2025
8
citations
PatchVSR: Breaking Video Diffusion Resolution Limits with Patch-wise Video Super-Resolution
CVPR 2025
3
citations
Anti-Diffusion: Preventing Abuse of Modifications of Diffusion-Based Models
AAAI 2025
2
citations
FullDiT: Video Generative Foundation Models with Multimodal Control via Full Attention
ICCV 2025
0
citations
Scaling Up to Excellence: Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild
CVPR 2024
0
citations
ReCamMaster: Camera-Controlled Generative Rendering from A Single Video
ICCV 2025
0
citations
X-Adapter: Adding Universal Compatibility of Plugins for Upgraded Diffusion Model
CVPR 2024
0
citations
Rethinking the Objectives of Vector-Quantized Tokenizers for Image Synthesis
CVPR 2024
0
citations
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
CVPR 2024
0
citations
PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding
CVPR 2024
0
citations
StyleMaster: Stylize Your Video with Artistic Generation and Translation
CVPR 2025
0
citations
Unifying Image Processing as Visual Prompting Question Answering
ICML 2024
0
citations
CustomCrafter: Customized Video Generation with Preserving Motion and Concept Composition Abilities
AAAI 2025
0
citations