Wengang Zhou
57
Papers
58
Total Citations
Papers (57)
Multi-Level Optimal Transport for Universal Cross-Tokenizer Knowledge Distillation on Language Models
AAAI 2025
22
citations
DesignDiffusion: High-Quality Text-to-Design Image Generation with Diffusion Models
CVPR 2025
19
citations
SmartEraser: Remove Anything from Images using Masked-Region Guidance
CVPR 2025
12
citations
I2VGuard: Safeguarding Images against Misuse in Diffusion-based Image-to-Video Models
CVPR 2025
3
citations
Image as a World: Generating Interactive World from Single Image via Panoramic Video Generation
NeurIPS 2025
1
citations
Revisiting Open-Set Panoptic Segmentation
AAAI 2024
1
citations
Learning Spatial Adaptation and Temporal Coherence in Diffusion Models for Video Super-Resolution
CVPR 2024
0
citations
Trustworthy Alignment of Retrieval-Augmented Large Language Models via Reinforcement Learning
ICML 2024
0
citations
Make-It-Animatable: An Efficient Framework for Authoring Animation-Ready 3D Characters
CVPR 2025arXiv
0
citations
Picking Deep Filter Responses for Fine-Grained Image Recognition
CVPR 2016
0
citations
Multi-Cue Correlation Filters for Robust Visual Tracking
CVPR 2018
0
citations
Unsupervised Deep Tracking
CVPR 2019
0
citations
Iterative Alignment Network for Continuous Sign Language Recognition
CVPR 2019
0
citations
Transformation GAN for Unsupervised Image Synthesis and Representation Learning
CVPR 2020
0
citations
Improving Sign Language Translation With Monolingual Data by Sign Back-Translation
CVPR 2021arXiv
0
citations
Model-Aware Gesture-to-Gesture Translation
CVPR 2021
0
citations
ATSO: Asynchronous Teacher-Student Optimization for Semi-Supervised Image Segmentation
CVPR 2021
0
citations
Transformer Meets Tracker: Exploiting Temporal Context for Robust Visual Tracking
CVPR 2021arXiv
0
citations
Uformer: A General U-Shaped Transformer for Image Restoration
CVPR 2022
0
citations
Contextual Similarity Distillation for Asymmetric Image Retrieval
CVPR 2022
0
citations
Domain-Agnostic Prior for Transfer Semantic Segmentation
CVPR 2022arXiv
0
citations
AnchorFormer: Point Cloud Completion From Discriminative Nodes
CVPR 2023
0
citations
Asymmetric Feature Fusion for Image Retrieval
CVPR 2023
0
citations
AltFreezing for More General Video Face Forgery Detection
CVPR 2023
0
citations
HandNeRF: Neural Radiance Fields for Animatable Interacting Hands
CVPR 2023arXiv
0
citations
Relation Distillation Networks for Video Object Detection
ICCV 2019
0
citations
Joint Inductive and Transductive Learning for Video Object Segmentation
ICCV 2021arXiv
0
citations
SignBERT: Pre-Training of Hand-Model-Aware Representation for Sign Language Recognition
ICCV 2021arXiv
0
citations
Instance-Wise Hard Negative Example Generation for Contrastive Learning in Unpaired Image-to-Image Translation
ICCV 2021arXiv
0
citations
Learning Deep Local Features With Multiple Dynamic Attentions for Large-Scale Image Retrieval
ICCV 2021
0
citations
TransVG: End-to-End Visual Grounding With Transformers
ICCV 2021arXiv
0
citations
Sign Language Translation with Iterative Prototype
ICCV 2023arXiv
0
citations
DIRE for Diffusion-Generated Image Detection
ICCV 2023arXiv
0
citations
SimFIR: A Simple Framework for Fisheye Image Rectification with Self-supervised Representation Learning
ICCV 2023arXiv
0
citations
Focus on Your Target: A Dual Teacher-Student Framework for Domain-Adaptive Semantic Segmentation
ICCV 2023arXiv
0
citations
Masked Motion Predictors are Strong 3D Action Representation Learners
ICCV 2023arXiv
0
citations
Cyclic-Bootstrap Labeling for Weakly Supervised Object Detection
ICCV 2023arXiv
0
citations
Wavelet-Based Dual-Branch Network for Image Demoiréing
ECCV 2020
0
citations
CMD: Self-Supervised 3D Action Representation Learning with Cross-Modal Mutual Distillation
ECCV 2022
0
citations
TAPE: Task-Agnostic Prior Embedding for Image Restoration
ECCV 2022
0
citations
CMT: Context-Matching-Guided Transformer for 3D Tracking in Point Clouds
ECCV 2022
0
citations
MVP: Multimodality-Guided Visual Pre-training
ECCV 2022
0
citations
Geometric Representation Learning for Document Image Rectification
ECCV 2022
0
citations
SOM: Semantic Obviousness Metric for Image Quality Assessment
CVPR 2015
0
citations
OPTICAL: Leveraging Optimal Transport for Contribution Allocation in Dataset Distillation
CVPR 2025
0
citations
Active Perception Meets Rule-Guided RL: A Two-Phase Approach for Precise Object Navigation in Complex Environments
ICCV 2025
0
citations
Aligning Global Semantics and Local Textures in Generative Video Enhancement
ICCV 2025
0
citations
SUF: Stabilized Unconstrained Fine-Tuning for Offline-to-Online Reinforcement Learning
AAAI 2024
0
citations
Instance-aware Exploration-Verification-Exploitation for Instance ImageGoal Navigation
CVPR 2024
0
citations
Contextual Similarity Aggregation with Self-attention for Visual Re-ranking
NeurIPS 2021
0
citations
LDSA: Learning Dynamic Subtask Assignment in Cooperative Multi-Agent Reinforcement Learning
NeurIPS 2022
0
citations
Hand-Object Interaction Image Generation
NeurIPS 2022
0
citations
Multi-Agent First Order Constrained Optimization in Policy Space
NeurIPS 2023
0
citations
CLIP4HOI: Towards Adapting CLIP for Practical Zero-Shot HOI Detection
NeurIPS 2023
0
citations
Hierarchical Multi-Agent Skill Discovery
NeurIPS 2023
0
citations
State Sequences Prediction via Fourier Transform for Representation Learning
NeurIPS 2023
0
citations
DIFFER:Decomposing Individual Reward for Fair Experience Replay in Multi-Agent Reinforcement Learning
NeurIPS 2023
0
citations