Lei Zhang
233
Papers
1,499
Total Citations
Papers (233)
SeeSR: Towards Semantics-Aware Real-World Image Super-Resolution
CVPR 2024
256
citations
Dual Adversarial Network: Toward Real-world Noise Removal and Noise Generation
ECCV 2020
243
citations
Pixel-Aware Stable Diffusion for Realistic Image Super-Resolution and Personalized Stylization
ECCV 2024
234
citations
Osprey: Pixel Understanding with Visual Instruction Tuning
CVPR 2024
147
citations
DreamTime: An Improved Optimization Strategy for Diffusion-Guided 3D Generation
ICLR 2024
78
citations
ChatTime: A Unified Multimodal Time Series Foundation Model Bridging Numerical and Textual Data
AAAI 2025
72
citations
Progressive3D: Progressively Local Editing for Text-to-3D Content Creation with Complex Semantic Prompts
ICLR 2024
54
citations
Visual In-Context Prompting
CVPR 2024
52
citations
Implicit Discriminative Knowledge Learning for Visible-Infrared Person Re-Identification
CVPR 2024
51
citations
Scaling Speech-Text Pre-training with Synthetic Interleaved Data
ICLR 2025
39
citations
CoCoCo: Improving Text-Guided Video Inpainting for Better Consistency, Controllability and Compatibility
AAAI 2025
38
citations
Open-World Human-Object Interaction Detection via Multi-modal Prompts
CVPR 2024
31
citations
ScaleDreamer: Scalable Text-to-3D Synthesis with Asynchronous Score Distillation
ECCV 2024
26
citations
Adversarial Diffusion Compression for Real-World Image Super-Resolution
CVPR 2025
25
citations
Toward Generalized Image Quality Assessment: Relaxing the Perfect Reference Quality Assumption
CVPR 2025
16
citations
Self-Supervised Video Desmoking for Laparoscopic Surgery
ECCV 2024
15
citations
Hierarchical Context Pruning: Optimizing Real-World Code Completion with Repository-Level Pretrained Code LLMs
AAAI 2025
15
citations
ScatterFormer: Efficient Voxel Transformer with Scattered Linear Attention
ECCV 2024
13
citations
Referring to Any Person
ICCV 2025arXiv
13
citations
Robust Overfitting Does Matter: Test-Time Adversarial Purification With FGSM
CVPR 2024
12
citations
SkillMimic: Learning Basketball Interaction Skills from Demonstrations
CVPR 2025
12
citations
Neural Super-Resolution for Real-time Rendering with Radiance Demodulation
CVPR 2024
9
citations
Generalized and Efficient 2D Gaussian Splatting for Arbitrary-scale Super-Resolution
ICCV 2025
9
citations
Symbol as Points: Panoptic Symbol Spotting via Point-based Representation
ICLR 2024
9
citations
Generalizable Sensor-Based Activity Recognition via Categorical Concept Invariant Learning
AAAI 2025
7
citations
D^2iT: Dynamic Diffusion Transformer for Accurate Image Generation
CVPR 2025
6
citations
HandOS: 3D Hand Reconstruction in One Stage
CVPR 2025arXiv
5
citations
Integrating Visual Interpretation and Linguistic Reasoning for Geometric Problem Solving
ICCV 2025
3
citations
HumanMM: Global Human Motion Recovery from Multi-shot Videos
CVPR 2025
3
citations
SyncNoise: Geometrically Consistent Noise Prediction for Instruction-based 3D Editing
AAAI 2025
2
citations
Reverse Convolution and Its Applications to Image Restoration
ICCV 2025arXiv
1
citations
PASS: Path-selective State Space Model for Event-based Recognition
NeurIPS 2025
1
citations
The Underappreciated Power of Vision Models for Graph Structural Understanding
NeurIPS 2025
1
citations
Multi-Edge Reinforced Collaborative Data Acquisition for Continuous Video Analytics by Prioritizing Quality over Quantity
AAAI 2025
1
citations
Linguistic-Aware Patch Slimming Framework for Fine-grained Cross-Modal Alignment
CVPR 2024
0
citations
UniVS: Unified and Universal Video Segmentation with Prompts as Queries
CVPR 2024
0
citations
Efficient Scene Recovery Using Luminous Flux Prior
CVPR 2024
0
citations
Uncertainty-Aware Source-Free Adaptive Image Super-Resolution with Wavelet Augmentation Transformer
CVPR 2024
0
citations
State-Constrained Zero-Sum Differential Games with One-Sided Information
ICML 2024
0
citations
DNA-SE: Towards Deep Neural-Nets Assisted Semiparametric Estimation
ICML 2024
0
citations
HumanTOMATO: Text-aligned Whole-body Motion Generation
ICML 2024
0
citations
Reweighted Laplace Prior Based Hyperspectral Compressive Sensing for Unknown Sparsity
CVPR 2015
0
citations
Discriminative Learning of Iteration-Wise Priors for Blind Deconvolution
CVPR 2015
0
citations
Joint Learning of Single-Image and Cross-Image Representations for Person Re-Identification
CVPR 2016
0
citations
Group MAD Competition - A New Methodology to Compare Objective Image Quality Models
CVPR 2016
0
citations
Multispectral Images Denoising by Intrinsic Tensor Sparsity Regularization
CVPR 2016
0
citations
Dictionary Pair Classifier Driven Convolutional Neural Networks for Object Detection
CVPR 2016
0
citations
A Probabilistic Collaborative Representation Based Approach for Pattern Classification
CVPR 2016
0
citations
Object Tracking via Dual Linear Structured SVM and Explicit Feature Map
CVPR 2016
0
citations
RAID-G: Robust Estimation of Approximate Infinite Dimensional Gaussian With Application to Material Recognition
CVPR 2016
0
citations
G2DeNet: Global Gaussian Distribution Embedding Network and Its Application to Visual Recognition
CVPR 2017
0
citations
Learning Dynamic Guidance for Depth Image Enhancement
CVPR 2017
0
citations
Learning Deep CNN Denoiser Prior for Image Restoration
CVPR 2017arXiv
0
citations
Fine-Tuning Convolutional Neural Networks for Biomedical Image Analysis: Actively and Incrementally
CVPR 2017
0
citations
Towards Human-Machine Cooperation: Self-Supervised Sample Mining for Object Detection
CVPR 2018arXiv
0
citations
Learning a Single Convolutional Super-Resolution Network for Multiple Degradations
CVPR 2018arXiv
0
citations
A Hybrid l1-l0 Layer Decomposition Model for Tone Mapping
CVPR 2018
0
citations
Learning Spatial-Temporal Regularized Correlation Filters for Visual Tracking
CVPR 2018arXiv
0
citations
CleanNet: Transfer Learning for Scalable Image Classifier Training With Label Noise
CVPR 2018arXiv
0
citations
Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering
CVPR 2018arXiv
0
citations
A PID Controller Approach for Stochastic Optimization of Deep Networks
CVPR 2018
0
citations
Deep Plug-And-Play Super-Resolution for Arbitrary Blur Kernels
CVPR 2019
0
citations
Toward Convolutional Blind Denoising of Real Photographs
CVPR 2019
0
citations
Reliable and Efficient Image Cropping: A Grid Anchor Based Approach
CVPR 2019
0
citations
FOCNet: A Fractional Optimal Control Network for Image Denoising
CVPR 2019
0
citations
Reinforced Cross-Modal Matching and Self-Supervised Imitation Learning for Vision-Language Navigation
CVPR 2019
0
citations
Variational Bayesian Dropout With a Hierarchical Prior
CVPR 2019
0
citations
Second-Order Attention Network for Single Image Super-Resolution
CVPR 2019
0
citations
Object-Driven Text-To-Image Synthesis via Adversarial Training
CVPR 2019
0
citations
Multi-Domain Learning for Accurate and Few-Shot Color Constancy
CVPR 2020
0
citations
Unsupervised Adaptation Learning for Hyperspectral Imagery Super-Resolution
CVPR 2020
0
citations
CPR-GCN: Conditional Partial-Residual Graph Convolutional Network in Automated Anatomical Labeling of Coronary Arteries
CVPR 2020
0
citations
HigherHRNet: Scale-Aware Representation Learning for Bottom-Up Human Pose Estimation
CVPR 2020arXiv
0
citations
Probability Weighted Compact Feature for Domain Adaptive Retrieval
CVPR 2020arXiv
0
citations
Structure Aware Single-Stage 3D Object Detection From Point Cloud
CVPR 2020
0
citations
VirFace: Enhancing Face Recognition via Unlabeled Shallow Data
CVPR 2021
0
citations
Contrastive Learning Based Hybrid Networks for Long-Tailed Image Classification
CVPR 2021arXiv
0
citations
VinVL: Revisiting Visual Representations in Vision-Language Models
CVPR 2021arXiv
0
citations
Spatial Feature Calibration and Temporal Fusion for Effective One-Stage Video Instance Segmentation
CVPR 2021arXiv
0
citations
PPR10K: A Large-Scale Portrait Photo Retouching Dataset With Human-Region Mask and Group-Level Consistency
CVPR 2021arXiv
0
citations
Unsupervised Part Segmentation Through Disentangling Appearance and Shape
CVPR 2021arXiv
0
citations
Progressive Semantic-Aware Style Transformation for Blind Face Restoration
CVPR 2021arXiv
0
citations
Interactive Self-Training With Mean Teachers for Semi-Supervised Object Detection
CVPR 2021
0
citations
Virtual Fully-Connected Layer: Training a Large-Scale Face Recognition Dataset With Limited Computational Resources
CVPR 2021
0
citations
Unsupervised Pre-Training for Person Re-Identification
CVPR 2021arXiv
0
citations
Learning Parallel Dense Correspondence From Spatio-Temporal Descriptors for Efficient and Robust 4D Reconstruction
CVPR 2021arXiv
0
citations
GAN Prior Embedded Network for Blind Face Restoration in the Wild
CVPR 2021arXiv
0
citations
Dynamic Weighted Learning for Unsupervised Domain Adaptation
CVPR 2021arXiv
0
citations
Dynamic Head: Unifying Object Detection Heads With Attentions
CVPR 2021arXiv
0
citations
Learning Tensor Low-Rank Prior for Hyperspectral Image Reconstruction
CVPR 2021
0
citations
Deep Convolutional Dictionary Learning for Image Denoising
CVPR 2021
0
citations
TAP: Text-Aware Pre-Training for Text-VQA and Text-Caption
CVPR 2021arXiv
0
citations
High-Resolution Photorealistic Image Translation in Real-Time: A Laplacian Pyramid Translation Network
CVPR 2021
0
citations
Lite-HRNet: A Lightweight High-Resolution Network
CVPR 2021
0
citations
DAP: Detection-Aware Pre-Training With Weak Supervision
CVPR 2021arXiv
0
citations
Voxel Set Transformer: A Set-to-Set Approach to 3D Object Detection From Point Clouds
CVPR 2022arXiv
0
citations
Exact Feature Distribution Matching for Arbitrary Style Transfer and Domain Generalization
CVPR 2022arXiv
0
citations
Details or Artifacts: A Locally Discriminative Learning Approach to Realistic Image Super-Resolution
CVPR 2022arXiv
0
citations
DN-DETR: Accelerate DETR Training by Introducing Query DeNoising
CVPR 2022
0
citations
Dense Learning Based Semi-Supervised Object Detection
CVPR 2022arXiv
0
citations
Quantization-Aware Deep Optics for Diffractive Snapshot Hyperspectral Imaging
CVPR 2022
0
citations
Grounded Language-Image Pre-Training
CVPR 2022arXiv
0
citations
Blind Image Super-Resolution With Elaborate Degradation Modeling on Noise and Kernel
CVPR 2022arXiv
0
citations
Large-Scale Pre-Training for Person Re-Identification With Noisy Labels
CVPR 2022arXiv
0
citations
Towards Efficient Data Free Black-Box Adversarial Attack
CVPR 2022
0
citations
A Differentiable Two-Stage Alignment Scheme for Burst Image Reconstruction With Large Shift
CVPR 2022arXiv
0
citations
Neural Architecture Search With Representation Mutual Information
CVPR 2022
0
citations
A Dual Weighting Label Assignment Scheme for Object Detection
CVPR 2022arXiv
0
citations
A Text Attention Network for Spatial Deformation Robust Scene Text Image Super-Resolution
CVPR 2022arXiv
0
citations
Class-Balanced Pixel-Level Self-Labeling for Domain Adaptive Semantic Segmentation
CVPR 2022arXiv
0
citations
DynaMask: Dynamic Mask Selection for Instance Segmentation
CVPR 2023arXiv
0
citations
Revisiting Prototypical Network for Cross Domain Few-Shot Learning
CVPR 2023
0
citations
A General Regret Bound of Preconditioned Gradient Method for DNN Training
CVPR 2023
0
citations
OTAvatar: One-Shot Talking Face Avatar With Controllable Tri-Plane Rendering
CVPR 2023arXiv
0
citations
Glocal Energy-Based Learning for Few-Shot Open-Set Recognition
CVPR 2023arXiv
0
citations
DisCo-CLIP: A Distributed Contrastive Loss for Memory Efficient CLIP Training
CVPR 2023
0
citations
SIM: Semantic-Aware Instance Mask Generation for Box-Supervised Instance Segmentation
CVPR 2023arXiv
0
citations
Accelerating Dataset Distillation via Model Augmentation
CVPR 2023arXiv
0
citations
Human-Art: A Versatile Human-Centric Dataset Bridging Natural and Artificial Scenes
CVPR 2023
0
citations
MSF: Motion-Guided Sequential Fusion for Efficient 3D Object Detection From Point Cloud Sequences
CVPR 2023arXiv
0
citations
MDQE: Mining Discriminative Query Embeddings To Segment Occluded Instances on Challenging Videos
CVPR 2023arXiv
0
citations
Sharpness-Aware Gradient Matching for Domain Generalization
CVPR 2023arXiv
0
citations
One-Stage 3D Whole-Body Mesh Recovery With Component Aware Transformer
CVPR 2023arXiv
0
citations
Human Guided Ground-Truth Generation for Realistic Image Super-Resolution
CVPR 2023arXiv
0
citations
Mask DINO: Towards a Unified Transformer-Based Framework for Object Detection and Segmentation
CVPR 2023arXiv
0
citations
Inferring and Leveraging Parts From Object Shape for Improving Semantic Image Synthesis
CVPR 2023
0
citations
Joint HDR Denoising and Fusion: A Real-World Mobile HDR Image Dataset
CVPR 2023
0
citations
MP-Former: Mask-Piloted Transformer for Image Segmentation
CVPR 2023
0
citations
One-to-Few Label Assignment for End-to-End Dense Detection
CVPR 2023arXiv
0
citations
Multi-View Adversarial Discriminator: Mine the Non-Causal Factors for Object Detection in Unseen Domains
CVPR 2023arXiv
0
citations
Lite DETR: An Interleaved Multi-Scale Encoder for Efficient DETR
CVPR 2023arXiv
0
citations
Patch Group Based Nonlocal Self-Similarity Prior Learning for Image Denoising
ICCV 2015
0
citations
External Patch Prior Guided Internal Clustering for Image Denoising
ICCV 2015
0
citations
Convolutional Sparse Coding for Image Super-Resolution
ICCV 2015
0
citations
Hyperspectral Compressive Sensing Using Manifold-Structured Sparsity Prior
ICCV 2015
0
citations
Higher-Order Integration of Hierarchical Convolutional Activations for Fine-Grained Visual Categorization
ICCV 2017
0
citations
When Unsupervised Domain Adaptation Meets Tensor Representations
ICCV 2017arXiv
0
citations
Multi-Channel Weighted Nuclear Norm Minimization for Real Color Image Denoising
ICCV 2017arXiv
0
citations
Joint Convolutional Analysis and Synthesis Sparse Representation for Single Image Layer Separation
ICCV 2017
0
citations
3D Surface Detail Enhancement From a Single Normal Map
ICCV 2017
0
citations
Toward Real-World Single Image Super-Resolution: A New Benchmark and a New Model
ICCV 2019
0
citations
Dynamic Anchor Feature Selection for Single-Shot Object Detection
ICCV 2019
0
citations
Multi-Adversarial Faster-RCNN for Unrestricted Object Detection
ICCV 2019
0
citations
WSOD2: Learning Bottom-Up and Top-Down Objectness Distillation for Weakly-Supervised Object Detection
ICCV 2019
0
citations
Multi-Scale Vision Longformer: A New Vision Transformer for High-Resolution Image Encoding
ICCV 2021arXiv
0
citations
Variational Attention: Propagating Domain-Specific Knowledge for Multi-Domain Learning in Crowd Counting
ICCV 2021arXiv
0
citations
SA-ConvONet: Sign-Agnostic Optimization of Convolutional Occupancy Networks
ICCV 2021
0
citations
Dynamic DETR: End-to-End Object Detection With Dynamic Attention
ICCV 2021
0
citations
CvT: Introducing Convolutions to Vision Transformers
ICCV 2021arXiv
0
citations
Real-World Video Super-Resolution: A Benchmark Dataset and a Decomposition Based Learning Scheme
ICCV 2021
0
citations
Reconcile Prediction Consistency for Balanced Object Detection
ICCV 2021arXiv
0
citations
HDR Video Reconstruction: A Coarse-To-Fine Network and a Real-World Benchmark Dataset
ICCV 2021arXiv
0
citations
MicroNet: Improving Image Recognition With Extremely Low FLOPs
ICCV 2021arXiv
0
citations
Improve Unsupervised Pretraining for Few-Label Transfer
ICCV 2021arXiv
0
citations
A Benchmark for Chinese-English Scene Text Image Super-Resolution
ICCV 2023arXiv
0
citations
CORE: Cooperative Reconstruction for Multi-Agent Perception
ICCV 2023arXiv
0
citations
Point2Mask: Point-supervised Panoptic Segmentation via Optimal Transport
ICCV 2023arXiv
0
citations
Towards Fairness-aware Adversarial Network Pruning
ICCV 2023
0
citations
A Simple Framework for Open-Vocabulary Segmentation and Detection
ICCV 2023arXiv
0
citations
FPR: False Positive Rectification for Weakly Supervised Semantic Segmentation
ICCV 2023
0
citations
DFA3D: 3D Deformable Attention For 2D-to-3D Feature Lifting
ICCV 2023arXiv
0
citations
RCA-NOC: Relative Contrastive Alignment for Novel Object Captioning
ICCV 2023
0
citations
Generative Action Description Prompts for Skeleton-based Action Recognition
ICCV 2023arXiv
0
citations
Detection Transformer with Stable Matching
ICCV 2023arXiv
0
citations
HumanSD: A Native Skeleton-Guided Diffusion Model for Human Image Generation
ICCV 2023arXiv
0
citations
Isomer: Isomerous Transformer for Zero-shot Video Object Segmentation
ICCV 2023arXiv
0
citations
ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation
ICCV 2023arXiv
0
citations
Neural Interactive Keypoint Detection
ICCV 2023arXiv
0
citations
Automatic Network Pruning via Hilbert-Schmidt Independence Criterion Lasso under Information Bottleneck Principle
ICCV 2023
0
citations
Gradient Centralization: A New Optimization Technique for Deep Neural Networks
ECCV 2020
0
citations
Suppress and Balance: A Simple Gated Network for Salient Object Detection
ECCV 2020
0
citations
Label Propagation with Augmented Anchors: A Simple Semi-Supervised Learning baseline for Unsupervised Domain Adaptation
ECCV 2020
0
citations
Blind Face Restoration via Deep Multi-scale Component Dictionaries
ECCV 2020
0
citations
LST-Net: Learning a Convolutional Neural Network with a Learnable Sparse Transform
ECCV 2020
0
citations
Momentum Batch Normalization for Deep Learning with Small Batch Size
ECCV 2020
0
citations
A Single Stream Network for Robust and Real-time RGB-D Salient Object Detection
ECCV 2020
0
citations
Domain Adaptive Object Detection via Asymmetric Tri-way Faster-RCNN
ECCV 2020
0
citations
A Decoupled Learning Scheme for Real-world Burst Denoising from Raw Images
ECCV 2020
0
citations
Boosting Weakly Supervised Object Detection with Progressive Knowledge Transfer
ECCV 2020
0
citations
Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks
ECCV 2020
0
citations
Spatiotemporal Self-Attention Modeling with Temporal Patch Shift for Action Recognition
ECCV 2022
0
citations
Dual-Stream Knowledge-Preserving Hashing for Unsupervised Video Retrieval
ECCV 2022
0
citations
Efficient Long-Range Attention Network for Image Super-Resolution
ECCV 2022
0
citations
From Face to Natural Image: Learning Real Degradation for Blind Image Super-Resolution
ECCV 2022
0
citations
Unfolded Deep Kernel Estimation for Blind Image Super-Resolution
ECCV 2022
0
citations
Efficient and Degradation-Adaptive Network for Real-World Image Super-Resolution
ECCV 2022
0
citations
An Embedded Feature Whitening Approach to Deep Neural Network Optimization
ECCV 2022
0
citations
Box-Supervised Instance Segmentation with Level Set Evolution
ECCV 2022
0
citations
Attention Diversification for Domain Generalization
ECCV 2022
0
citations
View Confusion Feature Learning for Person Re-Identification
ICCV 2019
0
citations
Low-Biased General Annotated Dataset Generation
CVPR 2025
0
citations
RORem: Training a Robust Object Remover with Human-in-the-Loop
CVPR 2025
0
citations
Pixel-level and Semantic-level Adjustable Super-resolution: A Dual-LoRA Approach
CVPR 2025
0
citations
MaSS13K: A Matting-level Semantic Segmentation Benchmark
CVPR 2025
0
citations
Progressive Rendering Distillation: Adapting Stable Diffusion for Instant Text-to-Mesh Generation without 3D Data
CVPR 2025
0
citations
LeanGaussian: Breaking Pixel or Point Cloud Correspondence in Modeling 3D Gaussians
CVPR 2025
0
citations
OSMamba: Omnidirectional Spectral Mamba with Dual-Domain Prior Generator for Exposure Correction
CVPR 2025
0
citations
FeedEdit: Text-Based Image Editing with Dynamic Feedback Regulation
CVPR 2025
0
citations
Prior-aware Dynamic Temporal Modeling Framework for Sequential 3D Hand Pose Estimation
ICCV 2025
0
citations
FiVE-Bench: A Fine-grained Video Editing Benchmark for Evaluating Emerging Diffusion and Rectified Flow Models
ICCV 2025
0
citations
Co-Painter: Fine-Grained Controllable Image Stylization via Implicit Decoupling and Adaptive Injection
ICCV 2025
0
citations
UniGS: Modeling Unitary 3D Gaussians for Novel View Synthesis from Sparse-view Images
ICCV 2025
0
citations
ForgeLens: Data-Efficient Forgery Focus for Generalizable Forgery Image Detection
ICCV 2025
0
citations
Fine-structure Preserved Real-world Image Super-resolution via Transfer VAE Training
ICCV 2025
0
citations
Towards Effective Foundation Model Adaptation for Extreme Cross-Domain Few-Shot Learning
ICCV 2025
0
citations
Hierarchy-Aware Pseudo Word Learning with Text Adaptation for Zero-Shot Composed Image Retrieval
ICCV 2025
0
citations
Dual-Temporal Exemplar Representation Network for Video Semantic Segmentation
ICCV 2025
0
citations
InsViE-1M: Effective Instruction-based Video Editing with Elaborate Dataset Construction
ICCV 2025
0
citations
Perceive, Understand and Restore: Real-World Image Super-Resolution with Autoregressive Multimodal Generative Models
ICCV 2025
0
citations
Polyline Path Masked Attention for Vision Transformer
NeurIPS 2025
0
citations
SLRL: Semi-Supervised Local Community Detection Based on Reinforcement Learning
AAAI 2025
0
citations
CustomContrast: A Multilevel Contrastive Perspective for Subject-Driven Text-to-Image Customization
AAAI 2025
0
citations
GaussianSR: High Fidelity 2D Gaussian Splatting for Arbitrary-Scale Image Super-Resolution
AAAI 2025
0
citations
Manta: Enhancing Mamba for Few-Shot Action Recognition of Long Sub-Sequence
AAAI 2025
0
citations
MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis
AAAI 2025
0
citations
GapMatch: Bridging Instance and Model Perturbations for Enhanced Semi-Supervised Medical Image Segmentation
AAAI 2025
0
citations
Adversarial Contrastive Graph Augmentation with Counterfactual Regularization
AAAI 2025
0
citations
Imitate Before Detect: Aligning Machine Stylistic Preference for Machine-Revised Text Detection
AAAI 2025
0
citations
Fine-Tuning Language Models with Collaborative and Semantic Experts
AAAI 2025
0
citations
Dynamic Weighted Combiner for Mixed-Modal Image Retrieval
AAAI 2024
0
citations
Identification of Necessary Semantic Undertakers in the Causal View for Image-Text Matching
AAAI 2024
0
citations
Gradual Residuals Alignment: A Dual-Stream Framework for GAN Inversion and Image Attribute Editing
AAAI 2024
0
citations
Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language Models
CVPR 2024
0
citations
Turbo Learning for CaptionBot and DrawingBot
NeurIPS 2018
0
citations
Variational Denoising Network: Toward Blind Noise Modeling and Removal
NeurIPS 2019
0
citations
Chasing Sparsity in Vision Transformers: An End-to-End Exploration
NeurIPS 2021
0
citations
DreamWaltz: Make a Scene with Complex 3D Animatable Avatars
NeurIPS 2023
0
citations
SMPLer-X: Scaling Up Expressive Human Pose and Shape Estimation
NeurIPS 2023
0
citations
Motion-X: A Large-scale 3D Expressive Whole-body Human Motion Dataset
NeurIPS 2023
0
citations
Semi-Supervised Domain Generalization with Known and Unknown Classes
NeurIPS 2023
0
citations
Label-efficient Segmentation via Affinity Propagation
NeurIPS 2023
0
citations
A Comprehensive Benchmark for Neural Human Radiance Fields
NeurIPS 2023
0
citations
MomentDiff: Generative Video Moment Retrieval from Random to Real
NeurIPS 2023
0
citations