Wangmeng Zuo

26
Papers
248
Total Citations

Papers (26)

GLAD: Towards Better Reconstruction with Global and Local Adaptive Diffusion Models for Unsupervised Anomaly Detection

ECCV 2024arXiv
57
citations

Improving Image Restoration through Removing Degradations in Textual Representations

CVPR 2024
45
citations

PLACE: Adaptive Layout-Semantic Fusion for Semantic Image Synthesis

CVPR 2024
23
citations

MC^2: Multi-concept Guidance for Customized Multi-concept Generation

CVPR 2025arXiv
21
citations

MV-VTON: Multi-View Virtual Try-On with Diffusion Models

AAAI 2025
20
citations

S2Gaussian: Sparse-View Super-Resolution 3D Gaussian Splatting

CVPR 2025
15
citations

Self-Supervised Video Desmoking for Laparoscopic Surgery

ECCV 2024
15
citations

Point-RFT: Improving Multimodal Reasoning with Visually Grounded Reinforcement Finetuning

NeurIPS 2025
12
citations

FramePainter: Endowing Interactive Image Editing with Video Diffusion Priors

ICCV 2025
12
citations

TextField3D: Towards Enhancing Open-Vocabulary 3D Generation with Noisy Text Fields

ICLR 2024
9
citations

ACE: Anti-Editing Concept Erasure in Text-to-Image Models

CVPR 2025
8
citations

Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformers

ICCV 2025
5
citations

Integrating Visual Interpretation and Linguistic Reasoning for Geometric Problem Solving

ICCV 2025
3
citations

MIRAGE: Assessing Hallucination in Multimodal Reasoning Chains of MLLM

NeurIPS 2025arXiv
2
citations

Triad: Empowering LMM-based Anomaly Detection with Expert-guided Region-of-Interest Tokenizer and Manufacturing Process

ICCV 2025
1
citations

DreamPhysics: Learning Physics-Based 3D Dynamics with Video Diffusion Priors

AAAI 2025
0
citations

Rethinking Transformer-Based Blind-Spot Network for Self-Supervised Image Denoising

AAAI 2025
0
citations

QR-LoRA: Efficient and Disentangled Fine-tuning via QR Decomposition for Customized Generation

ICCV 2025
0
citations

VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion Models

AAAI 2025
0
citations

Learning Real-World Image De-weathering with Imperfect Supervision

AAAI 2024
0
citations

3752 Decoupled Textual Embeddings for Customized Image Generation

AAAI 2024
0
citations

CASP: Consistency-aware Audio-induced Saliency Prediction Model for Omnidirectional Video

CVPR 2025
0
citations

Generative Inbetweening through Frame-wise Conditions-Driven Video Generation

CVPR 2025
0
citations

DreamControl: Control-Based Text-to-3D Generation with 3D Self-Prior

CVPR 2024
0
citations

ReMP-AD: Retrieval-enhanced Multi-modal Prompt Fusion for Few-Shot Industrial Visual Anomaly Detection

ICCV 2025
0
citations

VQA4CIR: Boosting Composed Image Retrieval with Visual Question Answering

AAAI 2025
0
citations