Wengang Zhou

57
Papers
58
Total Citations

Papers (57)

Multi-Level Optimal Transport for Universal Cross-Tokenizer Knowledge Distillation on Language Models

AAAI 2025
22
citations

DesignDiffusion: High-Quality Text-to-Design Image Generation with Diffusion Models

CVPR 2025
19
citations

SmartEraser: Remove Anything from Images using Masked-Region Guidance

CVPR 2025
12
citations

I2VGuard: Safeguarding Images against Misuse in Diffusion-based Image-to-Video Models

CVPR 2025
3
citations

Image as a World: Generating Interactive World from Single Image via Panoramic Video Generation

NeurIPS 2025
1
citations

Revisiting Open-Set Panoptic Segmentation

AAAI 2024
1
citations

Learning Spatial Adaptation and Temporal Coherence in Diffusion Models for Video Super-Resolution

CVPR 2024
0
citations

Trustworthy Alignment of Retrieval-Augmented Large Language Models via Reinforcement Learning

ICML 2024
0
citations

Make-It-Animatable: An Efficient Framework for Authoring Animation-Ready 3D Characters

CVPR 2025
0
citations

Picking Deep Filter Responses for Fine-Grained Image Recognition

CVPR 2016
0
citations

Multi-Cue Correlation Filters for Robust Visual Tracking

CVPR 2018
0
citations

Unsupervised Deep Tracking

CVPR 2019
0
citations

Iterative Alignment Network for Continuous Sign Language Recognition

CVPR 2019
0
citations

Transformation GAN for Unsupervised Image Synthesis and Representation Learning

CVPR 2020
0
citations

Improving Sign Language Translation With Monolingual Data by Sign Back-Translation

CVPR 2021arXiv
0
citations

Model-Aware Gesture-to-Gesture Translation

CVPR 2021
0
citations

ATSO: Asynchronous Teacher-Student Optimization for Semi-Supervised Image Segmentation

CVPR 2021
0
citations

Transformer Meets Tracker: Exploiting Temporal Context for Robust Visual Tracking

CVPR 2021arXiv
0
citations

Uformer: A General U-Shaped Transformer for Image Restoration

CVPR 2022
0
citations

Contextual Similarity Distillation for Asymmetric Image Retrieval

CVPR 2022
0
citations

Domain-Agnostic Prior for Transfer Semantic Segmentation

CVPR 2022arXiv
0
citations

AnchorFormer: Point Cloud Completion From Discriminative Nodes

CVPR 2023
0
citations

Asymmetric Feature Fusion for Image Retrieval

CVPR 2023
0
citations

AltFreezing for More General Video Face Forgery Detection

CVPR 2023
0
citations

HandNeRF: Neural Radiance Fields for Animatable Interacting Hands

CVPR 2023arXiv
0
citations

Relation Distillation Networks for Video Object Detection

ICCV 2019
0
citations

Joint Inductive and Transductive Learning for Video Object Segmentation

ICCV 2021arXiv
0
citations

SignBERT: Pre-Training of Hand-Model-Aware Representation for Sign Language Recognition

ICCV 2021arXiv
0
citations

Instance-Wise Hard Negative Example Generation for Contrastive Learning in Unpaired Image-to-Image Translation

ICCV 2021arXiv
0
citations

Learning Deep Local Features With Multiple Dynamic Attentions for Large-Scale Image Retrieval

ICCV 2021
0
citations

TransVG: End-to-End Visual Grounding With Transformers

ICCV 2021arXiv
0
citations

Sign Language Translation with Iterative Prototype

ICCV 2023arXiv
0
citations

DIRE for Diffusion-Generated Image Detection

ICCV 2023arXiv
0
citations

SimFIR: A Simple Framework for Fisheye Image Rectification with Self-supervised Representation Learning

ICCV 2023arXiv
0
citations

Focus on Your Target: A Dual Teacher-Student Framework for Domain-Adaptive Semantic Segmentation

ICCV 2023arXiv
0
citations

Masked Motion Predictors are Strong 3D Action Representation Learners

ICCV 2023arXiv
0
citations

Cyclic-Bootstrap Labeling for Weakly Supervised Object Detection

ICCV 2023arXiv
0
citations

Wavelet-Based Dual-Branch Network for Image Demoiréing

ECCV 2020
0
citations

CMD: Self-Supervised 3D Action Representation Learning with Cross-Modal Mutual Distillation

ECCV 2022
0
citations

TAPE: Task-Agnostic Prior Embedding for Image Restoration

ECCV 2022
0
citations

CMT: Context-Matching-Guided Transformer for 3D Tracking in Point Clouds

ECCV 2022
0
citations

MVP: Multimodality-Guided Visual Pre-training

ECCV 2022
0
citations

Geometric Representation Learning for Document Image Rectification

ECCV 2022
0
citations

SOM: Semantic Obviousness Metric for Image Quality Assessment

CVPR 2015
0
citations

OPTICAL: Leveraging Optimal Transport for Contribution Allocation in Dataset Distillation

CVPR 2025
0
citations

Active Perception Meets Rule-Guided RL: A Two-Phase Approach for Precise Object Navigation in Complex Environments

ICCV 2025
0
citations

Aligning Global Semantics and Local Textures in Generative Video Enhancement

ICCV 2025
0
citations

SUF: Stabilized Unconstrained Fine-Tuning for Offline-to-Online Reinforcement Learning

AAAI 2024
0
citations

Instance-aware Exploration-Verification-Exploitation for Instance ImageGoal Navigation

CVPR 2024
0
citations

Contextual Similarity Aggregation with Self-attention for Visual Re-ranking

NeurIPS 2021
0
citations

LDSA: Learning Dynamic Subtask Assignment in Cooperative Multi-Agent Reinforcement Learning

NeurIPS 2022
0
citations

Hand-Object Interaction Image Generation

NeurIPS 2022
0
citations

Multi-Agent First Order Constrained Optimization in Policy Space

NeurIPS 2023
0
citations

CLIP4HOI: Towards Adapting CLIP for Practical Zero-Shot HOI Detection

NeurIPS 2023
0
citations

Hierarchical Multi-Agent Skill Discovery

NeurIPS 2023
0
citations

State Sequences Prediction via Fourier Transform for Representation Learning

NeurIPS 2023
0
citations

DIFFER:Decomposing Individual Reward for Fair Experience Replay in Multi-Agent Reinforcement Learning

NeurIPS 2023
0
citations