Xintao Wang
46
Papers
2,824
Total Citations
1
Affiliations
Affiliations
The Chinese University of Hong Kong
Papers (46)
T2I-Adapter: Learning Adapters to Dig Out More Controllable Ability for Text-to-Image Diffusion
AAAI 2024arXiv
1,423
citations
Follow Your Pose: Pose-Guided Text-to-Video Generation Using Pose-Free Videos
AAAI 2024arXiv
276
citations
EvalCrafter: Benchmarking and Evaluating Large Video Generation Models
CVPR 2024
237
citations
BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion
ECCV 2024arXiv
163
citations
SmartEdit: Exploring Complex Instruction-based Image Editing with Multimodal Large Language Models
CVPR 2024
139
citations
ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with Diffusion Models
ICLR 2024
110
citations
Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners
CVPR 2024
109
citations
Improving Video Generation with Human Feedback
NeurIPS 2025
106
citations
DiffEditor: Boosting Accuracy and Flexibility on Diffusion-based Image Editing
CVPR 2024
89
citations
GameFactory: Creating New Games with Generative Interactive Videos
ICCV 2025
63
citations
Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation
ECCV 2024
50
citations
Image Conductor: Precision Control for Interactive Video Synthesis
AAAI 2025
46
citations
SketchVideo: Sketch-based Video Generation and Editing
CVPR 2025
8
citations
PatchVSR: Breaking Video Diffusion Resolution Limits with Patch-wise Video Super-Resolution
CVPR 2025
3
citations
Anti-Diffusion: Preventing Abuse of Modifications of Diffusion-Based Models
AAAI 2025
2
citations
Towards Real-World Blind Face Restoration With Generative Facial Prior
CVPR 2021arXiv
0
citations
Robust Reference-Based Super-Resolution via C2-Matching
CVPR 2021arXiv
0
citations
GLEAN: Generative Latent Bank for Large-Factor Image Super-Resolution
CVPR 2021arXiv
0
citations
OSRT: Omnidirectional Image Super-Resolution With Distortion-Aware Transformer
CVPR 2023arXiv
0
citations
Activating More Pixels in Image Super-Resolution Transformer
CVPR 2023arXiv
0
citations
Dream3D: Zero-Shot Text-to-3D Synthesis Using 3D Shape Prior and Text-to-Image Diffusion Models
CVPR 2023arXiv
0
citations
Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation
ICCV 2023
0
citations
MasaCtrl: Tuning-Free Mutual Self-Attention Control for Consistent Image Synthesis and Editing
ICCV 2023arXiv
0
citations
FateZero: Fusing Attentions for Zero-shot Text-based Video Editing
ICCV 2023arXiv
0
citations
Metric Learning Based Interactive Modulation for Real-World Super-Resolution
ECCV 2022
0
citations
VQFR: Blind Face Restoration with Vector-Quantized Dictionary and Parallel Decoder
ECCV 2022
0
citations
Towards Vivid and Diverse Image Colorization With Generative Color Prior
ICCV 2021arXiv
0
citations
StyleMaster: Stylize Your Video with Artistic Generation and Translation
CVPR 2025
0
citations
ReCamMaster: Camera-Controlled Generative Rendering from A Single Video
ICCV 2025
0
citations
FullDiT: Video Generative Foundation Models with Multimodal Control via Full Attention
ICCV 2025
0
citations
CustomCrafter: Customized Video Generation with Preserving Motion and Concept Composition Abilities
AAAI 2025
0
citations
Scaling Up to Excellence: Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild
CVPR 2024
0
citations
X-Adapter: Adding Universal Compatibility of Plugins for Upgraded Diffusion Model
CVPR 2024
0
citations
Rethinking the Objectives of Vector-Quantized Tokenizers for Image Synthesis
CVPR 2024
0
citations
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
CVPR 2024
0
citations
PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding
CVPR 2024
0
citations
Unifying Image Processing as Visual Prompting Question Answering
ICML 2024
0
citations
Recovering Realistic Texture in Image Super-Resolution by Deep Spatial Feature Transform
CVPR 2018arXiv
0
citations
Deep Network Interpolation for Continuous Imagery Effect Transition
CVPR 2019
0
citations
BasicVSR: The Search for Essential Components in Video Super-Resolution and Beyond
CVPR 2021arXiv
0
citations
Positional Encoding As Spatial Inductive Bias in GANs
CVPR 2021arXiv
0
citations
Finding Discriminative Filters for Specific Degradations in Blind Super-Resolution
NeurIPS 2021
0
citations
AnimeSR: Learning Real-World Super-Resolution Models for Animation Videos
NeurIPS 2022
0
citations
Rethinking Alignment in Video Super-Resolution Transformers
NeurIPS 2022
0
citations
Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models
NeurIPS 2023
0
citations
Inserting Anybody in Diffusion Models via Celeb Basis
NeurIPS 2023
0
citations