Lei Zhang

233
Papers
1,499
Total Citations

Papers (233)

SeeSR: Towards Semantics-Aware Real-World Image Super-Resolution

CVPR 2024
256
citations

Dual Adversarial Network: Toward Real-world Noise Removal and Noise Generation

ECCV 2020
243
citations

Pixel-Aware Stable Diffusion for Realistic Image Super-Resolution and Personalized Stylization

ECCV 2024
234
citations

Osprey: Pixel Understanding with Visual Instruction Tuning

CVPR 2024
147
citations

DreamTime: An Improved Optimization Strategy for Diffusion-Guided 3D Generation

ICLR 2024
78
citations

ChatTime: A Unified Multimodal Time Series Foundation Model Bridging Numerical and Textual Data

AAAI 2025
72
citations

Progressive3D: Progressively Local Editing for Text-to-3D Content Creation with Complex Semantic Prompts

ICLR 2024
54
citations

Visual In-Context Prompting

CVPR 2024
52
citations

Implicit Discriminative Knowledge Learning for Visible-Infrared Person Re-Identification

CVPR 2024
51
citations

Scaling Speech-Text Pre-training with Synthetic Interleaved Data

ICLR 2025
39
citations

CoCoCo: Improving Text-Guided Video Inpainting for Better Consistency, Controllability and Compatibility

AAAI 2025
38
citations

Open-World Human-Object Interaction Detection via Multi-modal Prompts

CVPR 2024
31
citations

ScaleDreamer: Scalable Text-to-3D Synthesis with Asynchronous Score Distillation

ECCV 2024
26
citations

Adversarial Diffusion Compression for Real-World Image Super-Resolution

CVPR 2025
25
citations

Toward Generalized Image Quality Assessment: Relaxing the Perfect Reference Quality Assumption

CVPR 2025
16
citations

Self-Supervised Video Desmoking for Laparoscopic Surgery

ECCV 2024
15
citations

Hierarchical Context Pruning: Optimizing Real-World Code Completion with Repository-Level Pretrained Code LLMs

AAAI 2025
15
citations

ScatterFormer: Efficient Voxel Transformer with Scattered Linear Attention

ECCV 2024
13
citations

Referring to Any Person

ICCV 2025arXiv
13
citations

Robust Overfitting Does Matter: Test-Time Adversarial Purification With FGSM

CVPR 2024
12
citations

SkillMimic: Learning Basketball Interaction Skills from Demonstrations

CVPR 2025
12
citations

Neural Super-Resolution for Real-time Rendering with Radiance Demodulation

CVPR 2024
9
citations

Generalized and Efficient 2D Gaussian Splatting for Arbitrary-scale Super-Resolution

ICCV 2025
9
citations

Symbol as Points: Panoptic Symbol Spotting via Point-based Representation

ICLR 2024
9
citations

Generalizable Sensor-Based Activity Recognition via Categorical Concept Invariant Learning

AAAI 2025
7
citations

D^2iT: Dynamic Diffusion Transformer for Accurate Image Generation

CVPR 2025
6
citations

HandOS: 3D Hand Reconstruction in One Stage

CVPR 2025arXiv
5
citations

Integrating Visual Interpretation and Linguistic Reasoning for Geometric Problem Solving

ICCV 2025
3
citations

HumanMM: Global Human Motion Recovery from Multi-shot Videos

CVPR 2025
3
citations

SyncNoise: Geometrically Consistent Noise Prediction for Instruction-based 3D Editing

AAAI 2025
2
citations

Reverse Convolution and Its Applications to Image Restoration

ICCV 2025arXiv
1
citations

PASS: Path-selective State Space Model for Event-based Recognition

NeurIPS 2025
1
citations

The Underappreciated Power of Vision Models for Graph Structural Understanding

NeurIPS 2025
1
citations

Multi-Edge Reinforced Collaborative Data Acquisition for Continuous Video Analytics by Prioritizing Quality over Quantity

AAAI 2025
1
citations

Linguistic-Aware Patch Slimming Framework for Fine-grained Cross-Modal Alignment

CVPR 2024
0
citations

UniVS: Unified and Universal Video Segmentation with Prompts as Queries

CVPR 2024
0
citations

Efficient Scene Recovery Using Luminous Flux Prior

CVPR 2024
0
citations

Uncertainty-Aware Source-Free Adaptive Image Super-Resolution with Wavelet Augmentation Transformer

CVPR 2024
0
citations

State-Constrained Zero-Sum Differential Games with One-Sided Information

ICML 2024
0
citations

DNA-SE: Towards Deep Neural-Nets Assisted Semiparametric Estimation

ICML 2024
0
citations

HumanTOMATO: Text-aligned Whole-body Motion Generation

ICML 2024
0
citations

Reweighted Laplace Prior Based Hyperspectral Compressive Sensing for Unknown Sparsity

CVPR 2015
0
citations

Discriminative Learning of Iteration-Wise Priors for Blind Deconvolution

CVPR 2015
0
citations

Joint Learning of Single-Image and Cross-Image Representations for Person Re-Identification

CVPR 2016
0
citations

Group MAD Competition - A New Methodology to Compare Objective Image Quality Models

CVPR 2016
0
citations

Multispectral Images Denoising by Intrinsic Tensor Sparsity Regularization

CVPR 2016
0
citations

Dictionary Pair Classifier Driven Convolutional Neural Networks for Object Detection

CVPR 2016
0
citations

A Probabilistic Collaborative Representation Based Approach for Pattern Classification

CVPR 2016
0
citations

Object Tracking via Dual Linear Structured SVM and Explicit Feature Map

CVPR 2016
0
citations

RAID-G: Robust Estimation of Approximate Infinite Dimensional Gaussian With Application to Material Recognition

CVPR 2016
0
citations

G2DeNet: Global Gaussian Distribution Embedding Network and Its Application to Visual Recognition

CVPR 2017
0
citations

Learning Dynamic Guidance for Depth Image Enhancement

CVPR 2017
0
citations

Learning Deep CNN Denoiser Prior for Image Restoration

CVPR 2017arXiv
0
citations

Fine-Tuning Convolutional Neural Networks for Biomedical Image Analysis: Actively and Incrementally

CVPR 2017
0
citations

Towards Human-Machine Cooperation: Self-Supervised Sample Mining for Object Detection

CVPR 2018arXiv
0
citations

Learning a Single Convolutional Super-Resolution Network for Multiple Degradations

CVPR 2018arXiv
0
citations

A Hybrid l1-l0 Layer Decomposition Model for Tone Mapping

CVPR 2018
0
citations

Learning Spatial-Temporal Regularized Correlation Filters for Visual Tracking

CVPR 2018arXiv
0
citations

CleanNet: Transfer Learning for Scalable Image Classifier Training With Label Noise

CVPR 2018arXiv
0
citations

Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering

CVPR 2018arXiv
0
citations

A PID Controller Approach for Stochastic Optimization of Deep Networks

CVPR 2018
0
citations

Deep Plug-And-Play Super-Resolution for Arbitrary Blur Kernels

CVPR 2019
0
citations

Toward Convolutional Blind Denoising of Real Photographs

CVPR 2019
0
citations

Reliable and Efficient Image Cropping: A Grid Anchor Based Approach

CVPR 2019
0
citations

FOCNet: A Fractional Optimal Control Network for Image Denoising

CVPR 2019
0
citations

Reinforced Cross-Modal Matching and Self-Supervised Imitation Learning for Vision-Language Navigation

CVPR 2019
0
citations

Variational Bayesian Dropout With a Hierarchical Prior

CVPR 2019
0
citations

Second-Order Attention Network for Single Image Super-Resolution

CVPR 2019
0
citations

Object-Driven Text-To-Image Synthesis via Adversarial Training

CVPR 2019
0
citations

Multi-Domain Learning for Accurate and Few-Shot Color Constancy

CVPR 2020
0
citations

Unsupervised Adaptation Learning for Hyperspectral Imagery Super-Resolution

CVPR 2020
0
citations

CPR-GCN: Conditional Partial-Residual Graph Convolutional Network in Automated Anatomical Labeling of Coronary Arteries

CVPR 2020
0
citations

HigherHRNet: Scale-Aware Representation Learning for Bottom-Up Human Pose Estimation

CVPR 2020arXiv
0
citations

Probability Weighted Compact Feature for Domain Adaptive Retrieval

CVPR 2020arXiv
0
citations

Structure Aware Single-Stage 3D Object Detection From Point Cloud

CVPR 2020
0
citations

VirFace: Enhancing Face Recognition via Unlabeled Shallow Data

CVPR 2021
0
citations

Contrastive Learning Based Hybrid Networks for Long-Tailed Image Classification

CVPR 2021arXiv
0
citations

VinVL: Revisiting Visual Representations in Vision-Language Models

CVPR 2021arXiv
0
citations

Spatial Feature Calibration and Temporal Fusion for Effective One-Stage Video Instance Segmentation

CVPR 2021arXiv
0
citations

PPR10K: A Large-Scale Portrait Photo Retouching Dataset With Human-Region Mask and Group-Level Consistency

CVPR 2021arXiv
0
citations

Unsupervised Part Segmentation Through Disentangling Appearance and Shape

CVPR 2021arXiv
0
citations

Progressive Semantic-Aware Style Transformation for Blind Face Restoration

CVPR 2021arXiv
0
citations

Interactive Self-Training With Mean Teachers for Semi-Supervised Object Detection

CVPR 2021
0
citations

Virtual Fully-Connected Layer: Training a Large-Scale Face Recognition Dataset With Limited Computational Resources

CVPR 2021
0
citations

Unsupervised Pre-Training for Person Re-Identification

CVPR 2021arXiv
0
citations

Learning Parallel Dense Correspondence From Spatio-Temporal Descriptors for Efficient and Robust 4D Reconstruction

CVPR 2021arXiv
0
citations

GAN Prior Embedded Network for Blind Face Restoration in the Wild

CVPR 2021arXiv
0
citations

Dynamic Weighted Learning for Unsupervised Domain Adaptation

CVPR 2021arXiv
0
citations

Dynamic Head: Unifying Object Detection Heads With Attentions

CVPR 2021arXiv
0
citations

Learning Tensor Low-Rank Prior for Hyperspectral Image Reconstruction

CVPR 2021
0
citations

Deep Convolutional Dictionary Learning for Image Denoising

CVPR 2021
0
citations

TAP: Text-Aware Pre-Training for Text-VQA and Text-Caption

CVPR 2021arXiv
0
citations

High-Resolution Photorealistic Image Translation in Real-Time: A Laplacian Pyramid Translation Network

CVPR 2021
0
citations

Lite-HRNet: A Lightweight High-Resolution Network

CVPR 2021
0
citations

DAP: Detection-Aware Pre-Training With Weak Supervision

CVPR 2021arXiv
0
citations

Voxel Set Transformer: A Set-to-Set Approach to 3D Object Detection From Point Clouds

CVPR 2022arXiv
0
citations

Exact Feature Distribution Matching for Arbitrary Style Transfer and Domain Generalization

CVPR 2022arXiv
0
citations

Details or Artifacts: A Locally Discriminative Learning Approach to Realistic Image Super-Resolution

CVPR 2022arXiv
0
citations

DN-DETR: Accelerate DETR Training by Introducing Query DeNoising

CVPR 2022
0
citations

Dense Learning Based Semi-Supervised Object Detection

CVPR 2022arXiv
0
citations

Quantization-Aware Deep Optics for Diffractive Snapshot Hyperspectral Imaging

CVPR 2022
0
citations

Grounded Language-Image Pre-Training

CVPR 2022arXiv
0
citations

Blind Image Super-Resolution With Elaborate Degradation Modeling on Noise and Kernel

CVPR 2022arXiv
0
citations

Large-Scale Pre-Training for Person Re-Identification With Noisy Labels

CVPR 2022arXiv
0
citations

Towards Efficient Data Free Black-Box Adversarial Attack

CVPR 2022
0
citations

A Differentiable Two-Stage Alignment Scheme for Burst Image Reconstruction With Large Shift

CVPR 2022arXiv
0
citations

Neural Architecture Search With Representation Mutual Information

CVPR 2022
0
citations

A Dual Weighting Label Assignment Scheme for Object Detection

CVPR 2022arXiv
0
citations

A Text Attention Network for Spatial Deformation Robust Scene Text Image Super-Resolution

CVPR 2022arXiv
0
citations

Class-Balanced Pixel-Level Self-Labeling for Domain Adaptive Semantic Segmentation

CVPR 2022arXiv
0
citations

DynaMask: Dynamic Mask Selection for Instance Segmentation

CVPR 2023arXiv
0
citations

Revisiting Prototypical Network for Cross Domain Few-Shot Learning

CVPR 2023
0
citations

A General Regret Bound of Preconditioned Gradient Method for DNN Training

CVPR 2023
0
citations

OTAvatar: One-Shot Talking Face Avatar With Controllable Tri-Plane Rendering

CVPR 2023arXiv
0
citations

Glocal Energy-Based Learning for Few-Shot Open-Set Recognition

CVPR 2023arXiv
0
citations

DisCo-CLIP: A Distributed Contrastive Loss for Memory Efficient CLIP Training

CVPR 2023
0
citations

SIM: Semantic-Aware Instance Mask Generation for Box-Supervised Instance Segmentation

CVPR 2023arXiv
0
citations

Accelerating Dataset Distillation via Model Augmentation

CVPR 2023arXiv
0
citations

Human-Art: A Versatile Human-Centric Dataset Bridging Natural and Artificial Scenes

CVPR 2023
0
citations

MSF: Motion-Guided Sequential Fusion for Efficient 3D Object Detection From Point Cloud Sequences

CVPR 2023arXiv
0
citations

MDQE: Mining Discriminative Query Embeddings To Segment Occluded Instances on Challenging Videos

CVPR 2023arXiv
0
citations

Sharpness-Aware Gradient Matching for Domain Generalization

CVPR 2023arXiv
0
citations

One-Stage 3D Whole-Body Mesh Recovery With Component Aware Transformer

CVPR 2023arXiv
0
citations

Human Guided Ground-Truth Generation for Realistic Image Super-Resolution

CVPR 2023arXiv
0
citations

Mask DINO: Towards a Unified Transformer-Based Framework for Object Detection and Segmentation

CVPR 2023arXiv
0
citations

Inferring and Leveraging Parts From Object Shape for Improving Semantic Image Synthesis

CVPR 2023
0
citations

Joint HDR Denoising and Fusion: A Real-World Mobile HDR Image Dataset

CVPR 2023
0
citations

MP-Former: Mask-Piloted Transformer for Image Segmentation

CVPR 2023
0
citations

One-to-Few Label Assignment for End-to-End Dense Detection

CVPR 2023arXiv
0
citations

Multi-View Adversarial Discriminator: Mine the Non-Causal Factors for Object Detection in Unseen Domains

CVPR 2023arXiv
0
citations

Lite DETR: An Interleaved Multi-Scale Encoder for Efficient DETR

CVPR 2023arXiv
0
citations

Patch Group Based Nonlocal Self-Similarity Prior Learning for Image Denoising

ICCV 2015
0
citations

External Patch Prior Guided Internal Clustering for Image Denoising

ICCV 2015
0
citations

Convolutional Sparse Coding for Image Super-Resolution

ICCV 2015
0
citations

Hyperspectral Compressive Sensing Using Manifold-Structured Sparsity Prior

ICCV 2015
0
citations

Higher-Order Integration of Hierarchical Convolutional Activations for Fine-Grained Visual Categorization

ICCV 2017
0
citations

When Unsupervised Domain Adaptation Meets Tensor Representations

ICCV 2017arXiv
0
citations

Multi-Channel Weighted Nuclear Norm Minimization for Real Color Image Denoising

ICCV 2017arXiv
0
citations

Joint Convolutional Analysis and Synthesis Sparse Representation for Single Image Layer Separation

ICCV 2017
0
citations

3D Surface Detail Enhancement From a Single Normal Map

ICCV 2017
0
citations

Toward Real-World Single Image Super-Resolution: A New Benchmark and a New Model

ICCV 2019
0
citations

Dynamic Anchor Feature Selection for Single-Shot Object Detection

ICCV 2019
0
citations

Multi-Adversarial Faster-RCNN for Unrestricted Object Detection

ICCV 2019
0
citations

WSOD2: Learning Bottom-Up and Top-Down Objectness Distillation for Weakly-Supervised Object Detection

ICCV 2019
0
citations

Multi-Scale Vision Longformer: A New Vision Transformer for High-Resolution Image Encoding

ICCV 2021arXiv
0
citations

Variational Attention: Propagating Domain-Specific Knowledge for Multi-Domain Learning in Crowd Counting

ICCV 2021arXiv
0
citations

SA-ConvONet: Sign-Agnostic Optimization of Convolutional Occupancy Networks

ICCV 2021
0
citations

Dynamic DETR: End-to-End Object Detection With Dynamic Attention

ICCV 2021
0
citations

CvT: Introducing Convolutions to Vision Transformers

ICCV 2021arXiv
0
citations

Real-World Video Super-Resolution: A Benchmark Dataset and a Decomposition Based Learning Scheme

ICCV 2021
0
citations

Reconcile Prediction Consistency for Balanced Object Detection

ICCV 2021arXiv
0
citations

HDR Video Reconstruction: A Coarse-To-Fine Network and a Real-World Benchmark Dataset

ICCV 2021arXiv
0
citations

MicroNet: Improving Image Recognition With Extremely Low FLOPs

ICCV 2021arXiv
0
citations

Improve Unsupervised Pretraining for Few-Label Transfer

ICCV 2021arXiv
0
citations

A Benchmark for Chinese-English Scene Text Image Super-Resolution

ICCV 2023arXiv
0
citations

CORE: Cooperative Reconstruction for Multi-Agent Perception

ICCV 2023arXiv
0
citations

Point2Mask: Point-supervised Panoptic Segmentation via Optimal Transport

ICCV 2023arXiv
0
citations

Towards Fairness-aware Adversarial Network Pruning

ICCV 2023
0
citations

A Simple Framework for Open-Vocabulary Segmentation and Detection

ICCV 2023arXiv
0
citations

FPR: False Positive Rectification for Weakly Supervised Semantic Segmentation

ICCV 2023
0
citations

DFA3D: 3D Deformable Attention For 2D-to-3D Feature Lifting

ICCV 2023arXiv
0
citations

RCA-NOC: Relative Contrastive Alignment for Novel Object Captioning

ICCV 2023
0
citations

Generative Action Description Prompts for Skeleton-based Action Recognition

ICCV 2023arXiv
0
citations

Detection Transformer with Stable Matching

ICCV 2023arXiv
0
citations

HumanSD: A Native Skeleton-Guided Diffusion Model for Human Image Generation

ICCV 2023arXiv
0
citations

Isomer: Isomerous Transformer for Zero-shot Video Object Segmentation

ICCV 2023arXiv
0
citations

ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation

ICCV 2023arXiv
0
citations

Neural Interactive Keypoint Detection

ICCV 2023arXiv
0
citations

Automatic Network Pruning via Hilbert-Schmidt Independence Criterion Lasso under Information Bottleneck Principle

ICCV 2023
0
citations

Gradient Centralization: A New Optimization Technique for Deep Neural Networks

ECCV 2020
0
citations

Suppress and Balance: A Simple Gated Network for Salient Object Detection

ECCV 2020
0
citations

Label Propagation with Augmented Anchors: A Simple Semi-Supervised Learning baseline for Unsupervised Domain Adaptation

ECCV 2020
0
citations

Blind Face Restoration via Deep Multi-scale Component Dictionaries

ECCV 2020
0
citations

LST-Net: Learning a Convolutional Neural Network with a Learnable Sparse Transform

ECCV 2020
0
citations

Momentum Batch Normalization for Deep Learning with Small Batch Size

ECCV 2020
0
citations

A Single Stream Network for Robust and Real-time RGB-D Salient Object Detection

ECCV 2020
0
citations

Domain Adaptive Object Detection via Asymmetric Tri-way Faster-RCNN

ECCV 2020
0
citations

A Decoupled Learning Scheme for Real-world Burst Denoising from Raw Images

ECCV 2020
0
citations

Boosting Weakly Supervised Object Detection with Progressive Knowledge Transfer

ECCV 2020
0
citations

Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks

ECCV 2020
0
citations

Spatiotemporal Self-Attention Modeling with Temporal Patch Shift for Action Recognition

ECCV 2022
0
citations

Dual-Stream Knowledge-Preserving Hashing for Unsupervised Video Retrieval

ECCV 2022
0
citations

Efficient Long-Range Attention Network for Image Super-Resolution

ECCV 2022
0
citations

From Face to Natural Image: Learning Real Degradation for Blind Image Super-Resolution

ECCV 2022
0
citations

Unfolded Deep Kernel Estimation for Blind Image Super-Resolution

ECCV 2022
0
citations

Efficient and Degradation-Adaptive Network for Real-World Image Super-Resolution

ECCV 2022
0
citations

An Embedded Feature Whitening Approach to Deep Neural Network Optimization

ECCV 2022
0
citations

Box-Supervised Instance Segmentation with Level Set Evolution

ECCV 2022
0
citations

Attention Diversification for Domain Generalization

ECCV 2022
0
citations

View Confusion Feature Learning for Person Re-Identification

ICCV 2019
0
citations

Low-Biased General Annotated Dataset Generation

CVPR 2025
0
citations

RORem: Training a Robust Object Remover with Human-in-the-Loop

CVPR 2025
0
citations

Pixel-level and Semantic-level Adjustable Super-resolution: A Dual-LoRA Approach

CVPR 2025
0
citations

MaSS13K: A Matting-level Semantic Segmentation Benchmark

CVPR 2025
0
citations

Progressive Rendering Distillation: Adapting Stable Diffusion for Instant Text-to-Mesh Generation without 3D Data

CVPR 2025
0
citations

LeanGaussian: Breaking Pixel or Point Cloud Correspondence in Modeling 3D Gaussians

CVPR 2025
0
citations

OSMamba: Omnidirectional Spectral Mamba with Dual-Domain Prior Generator for Exposure Correction

CVPR 2025
0
citations

FeedEdit: Text-Based Image Editing with Dynamic Feedback Regulation

CVPR 2025
0
citations

Prior-aware Dynamic Temporal Modeling Framework for Sequential 3D Hand Pose Estimation

ICCV 2025
0
citations

FiVE-Bench: A Fine-grained Video Editing Benchmark for Evaluating Emerging Diffusion and Rectified Flow Models

ICCV 2025
0
citations

Co-Painter: Fine-Grained Controllable Image Stylization via Implicit Decoupling and Adaptive Injection

ICCV 2025
0
citations

UniGS: Modeling Unitary 3D Gaussians for Novel View Synthesis from Sparse-view Images

ICCV 2025
0
citations

ForgeLens: Data-Efficient Forgery Focus for Generalizable Forgery Image Detection

ICCV 2025
0
citations

Fine-structure Preserved Real-world Image Super-resolution via Transfer VAE Training

ICCV 2025
0
citations

Towards Effective Foundation Model Adaptation for Extreme Cross-Domain Few-Shot Learning

ICCV 2025
0
citations

Hierarchy-Aware Pseudo Word Learning with Text Adaptation for Zero-Shot Composed Image Retrieval

ICCV 2025
0
citations

Dual-Temporal Exemplar Representation Network for Video Semantic Segmentation

ICCV 2025
0
citations

InsViE-1M: Effective Instruction-based Video Editing with Elaborate Dataset Construction

ICCV 2025
0
citations

Perceive, Understand and Restore: Real-World Image Super-Resolution with Autoregressive Multimodal Generative Models

ICCV 2025
0
citations

Polyline Path Masked Attention for Vision Transformer

NeurIPS 2025
0
citations

SLRL: Semi-Supervised Local Community Detection Based on Reinforcement Learning

AAAI 2025
0
citations

CustomContrast: A Multilevel Contrastive Perspective for Subject-Driven Text-to-Image Customization

AAAI 2025
0
citations

GaussianSR: High Fidelity 2D Gaussian Splatting for Arbitrary-Scale Image Super-Resolution

AAAI 2025
0
citations

Manta: Enhancing Mamba for Few-Shot Action Recognition of Long Sub-Sequence

AAAI 2025
0
citations

MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis

AAAI 2025
0
citations

GapMatch: Bridging Instance and Model Perturbations for Enhanced Semi-Supervised Medical Image Segmentation

AAAI 2025
0
citations

Adversarial Contrastive Graph Augmentation with Counterfactual Regularization

AAAI 2025
0
citations

Imitate Before Detect: Aligning Machine Stylistic Preference for Machine-Revised Text Detection

AAAI 2025
0
citations

Fine-Tuning Language Models with Collaborative and Semantic Experts

AAAI 2025
0
citations

Dynamic Weighted Combiner for Mixed-Modal Image Retrieval

AAAI 2024
0
citations

Identification of Necessary Semantic Undertakers in the Causal View for Image-Text Matching

AAAI 2024
0
citations

Gradual Residuals Alignment: A Dual-Stream Framework for GAN Inversion and Image Attribute Editing

AAAI 2024
0
citations

Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language Models

CVPR 2024
0
citations

Turbo Learning for CaptionBot and DrawingBot

NeurIPS 2018
0
citations

Variational Denoising Network: Toward Blind Noise Modeling and Removal

NeurIPS 2019
0
citations

Chasing Sparsity in Vision Transformers: An End-to-End Exploration

NeurIPS 2021
0
citations

DreamWaltz: Make a Scene with Complex 3D Animatable Avatars

NeurIPS 2023
0
citations

SMPLer-X: Scaling Up Expressive Human Pose and Shape Estimation

NeurIPS 2023
0
citations

Motion-X: A Large-scale 3D Expressive Whole-body Human Motion Dataset

NeurIPS 2023
0
citations

Semi-Supervised Domain Generalization with Known and Unknown Classes

NeurIPS 2023
0
citations

Label-efficient Segmentation via Affinity Propagation

NeurIPS 2023
0
citations

A Comprehensive Benchmark for Neural Human Radiance Fields

NeurIPS 2023
0
citations

MomentDiff: Generative Video Moment Retrieval from Random to Real

NeurIPS 2023
0
citations