Xintao Wang

46
Papers
2,824
Total Citations
1
Affiliations

Affiliations

The Chinese University of Hong Kong

Papers (46)

T2I-Adapter: Learning Adapters to Dig Out More Controllable Ability for Text-to-Image Diffusion

AAAI 2024arXiv
1,423
citations

Follow Your Pose: Pose-Guided Text-to-Video Generation Using Pose-Free Videos

AAAI 2024arXiv
276
citations

EvalCrafter: Benchmarking and Evaluating Large Video Generation Models

CVPR 2024
237
citations

BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion

ECCV 2024arXiv
163
citations

SmartEdit: Exploring Complex Instruction-based Image Editing with Multimodal Large Language Models

CVPR 2024
139
citations

ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with Diffusion Models

ICLR 2024
110
citations

Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners

CVPR 2024
109
citations

Improving Video Generation with Human Feedback

NeurIPS 2025
106
citations

DiffEditor: Boosting Accuracy and Flexibility on Diffusion-based Image Editing

CVPR 2024
89
citations

GameFactory: Creating New Games with Generative Interactive Videos

ICCV 2025
63
citations

Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation

ECCV 2024
50
citations

Image Conductor: Precision Control for Interactive Video Synthesis

AAAI 2025
46
citations

SketchVideo: Sketch-based Video Generation and Editing

CVPR 2025
8
citations

PatchVSR: Breaking Video Diffusion Resolution Limits with Patch-wise Video Super-Resolution

CVPR 2025
3
citations

Anti-Diffusion: Preventing Abuse of Modifications of Diffusion-Based Models

AAAI 2025
2
citations

Towards Real-World Blind Face Restoration With Generative Facial Prior

CVPR 2021arXiv
0
citations

Robust Reference-Based Super-Resolution via C2-Matching

CVPR 2021arXiv
0
citations

GLEAN: Generative Latent Bank for Large-Factor Image Super-Resolution

CVPR 2021arXiv
0
citations

OSRT: Omnidirectional Image Super-Resolution With Distortion-Aware Transformer

CVPR 2023arXiv
0
citations

Activating More Pixels in Image Super-Resolution Transformer

CVPR 2023arXiv
0
citations

Dream3D: Zero-Shot Text-to-3D Synthesis Using 3D Shape Prior and Text-to-Image Diffusion Models

CVPR 2023arXiv
0
citations

Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation

ICCV 2023
0
citations

MasaCtrl: Tuning-Free Mutual Self-Attention Control for Consistent Image Synthesis and Editing

ICCV 2023arXiv
0
citations

FateZero: Fusing Attentions for Zero-shot Text-based Video Editing

ICCV 2023arXiv
0
citations

Metric Learning Based Interactive Modulation for Real-World Super-Resolution

ECCV 2022
0
citations

VQFR: Blind Face Restoration with Vector-Quantized Dictionary and Parallel Decoder

ECCV 2022
0
citations

Towards Vivid and Diverse Image Colorization With Generative Color Prior

ICCV 2021arXiv
0
citations

StyleMaster: Stylize Your Video with Artistic Generation and Translation

CVPR 2025
0
citations

ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

ICCV 2025
0
citations

FullDiT: Video Generative Foundation Models with Multimodal Control via Full Attention

ICCV 2025
0
citations

CustomCrafter: Customized Video Generation with Preserving Motion and Concept Composition Abilities

AAAI 2025
0
citations

Scaling Up to Excellence: Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild

CVPR 2024
0
citations

X-Adapter: Adding Universal Compatibility of Plugins for Upgraded Diffusion Model

CVPR 2024
0
citations

Rethinking the Objectives of Vector-Quantized Tokenizers for Image Synthesis

CVPR 2024
0
citations

VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models

CVPR 2024
0
citations

PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding

CVPR 2024
0
citations

Unifying Image Processing as Visual Prompting Question Answering

ICML 2024
0
citations

Recovering Realistic Texture in Image Super-Resolution by Deep Spatial Feature Transform

CVPR 2018arXiv
0
citations

Deep Network Interpolation for Continuous Imagery Effect Transition

CVPR 2019
0
citations

BasicVSR: The Search for Essential Components in Video Super-Resolution and Beyond

CVPR 2021arXiv
0
citations

Positional Encoding As Spatial Inductive Bias in GANs

CVPR 2021arXiv
0
citations

Finding Discriminative Filters for Specific Degradations in Blind Super-Resolution

NeurIPS 2021
0
citations

AnimeSR: Learning Real-World Super-Resolution Models for Animation Videos

NeurIPS 2022
0
citations

Rethinking Alignment in Video Super-Resolution Transformers

NeurIPS 2022
0
citations

Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models

NeurIPS 2023
0
citations

Inserting Anybody in Diffusion Models via Celeb Basis

NeurIPS 2023
0
citations