56
Papers
1,331
Total Citations

Papers (56)

Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

CVPR 2025
858
citations

Deep Supervised Discrete Hashing

NeurIPS 2017arXiv
324
citations

ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with Diffusion Models

ICLR 2024
110
citations

Breaking the Low-Rank Dilemma of Linear Attention

CVPR 2025
15
citations

R-TPT: Improving Adversarial Robustness of Vision-Language Models through Test-Time Prompt Tuning

CVPR 2025
13
citations

Video-SafetyBench: A Benchmark for Safety Evaluation of Video LVLMs

NeurIPS 2025
7
citations

Semantic Equitable Clustering: A Simple and Effective Strategy for Clustering Vision Tokens

ICCV 2025
3
citations

DiCo: Revitalizing ConvNets for Scalable and Efficient Diffusion Modeling

NeurIPS 2025
1
citations

RMT: Retentive Networks Meet Vision Transformers

CVPR 2024
0
citations

Uncertainty-Aware Source-Free Adaptive Image Super-Resolution with Wavelet Augmentation Transformer

CVPR 2024
0
citations

Multimodal Prompt Perceiver: Empower Adaptiveness Generalizability and Fidelity for All-in-One Image Restoration

CVPR 2024
0
citations

Backdoor Defense via Test-Time Detecting and Repairing

CVPR 2024
0
citations

Realistic Unsupervised CLIP Fine-tuning with Universal Entropy Optimization

ICML 2024
0
citations

Connecting the Dots: Collaborative Fine-tuning for Black-Box Vision-Language Models

ICML 2024
0
citations

Pose-Guided Photorealistic Face Rotation

CVPR 2018
0
citations

Distant Supervised Centroid Shift: A Simple and Efficient Approach to Visual Domain Adaptation

CVPR 2019
0
citations

Cross-Spectral Face Hallucination via Disentangling Independent Factors

CVPR 2020arXiv
0
citations

PSGAN: Pose and Expression Robust Spatial-Aware GAN for Customizable Makeup Transfer

CVPR 2020arXiv
0
citations

GP-NAS: Gaussian Process Based Neural Architecture Search

CVPR 2020
0
citations

Information Bottleneck Disentanglement for Identity Swapping

CVPR 2021
0
citations

ReMix: Towards Image-to-Image Translation With Limited Data

CVPR 2021arXiv
0
citations

Pareidolia Face Reenactment

CVPR 2021arXiv
0
citations

Memory Oriented Transfer Learning for Semi-Supervised Image Deraining

CVPR 2021
0
citations

FaceInpainter: High Fidelity Face Adaptation to Heterogeneous Domains

CVPR 2021
0
citations

DINE: Domain Adaptation From Single and Multiple Black-Box Predictors

CVPR 2022arXiv
0
citations

Improving Subgraph Recognition With Variational Graph Information Bottleneck

CVPR 2022arXiv
0
citations

Rethinking Image Cropping: Exploring Diverse Compositions From Global Views

CVPR 2022
0
citations

Few-Shot Backdoor Defense Using Shapley Estimation

CVPR 2022arXiv
0
citations

Mind the Label Shift of Augmentation-Based Graph OOD Generalization

CVPR 2023arXiv
0
citations

Beyond Face Rotation: Global and Local Perception GAN for Photorealistic and Identity Preserving Frontal View Synthesis

ICCV 2017arXiv
0
citations

Make a Face: Towards Arbitrary High Fidelity Face Manipulation

ICCV 2019
0
citations

M2FPA: A Multi-Yaw Multi-Pitch High-Quality Dataset and Benchmark for Facial Pose Analysis

ICCV 2019
0
citations

Invisible Backdoor Attack With Sample-Specific Triggers

ICCV 2021arXiv
0
citations

CM-NAS: Cross-Modality Neural Architecture Search for Visible-Infrared Person Re-Identification

ICCV 2021
0
citations

TALL: Thumbnail Layout for Deepfake Video Detection

ICCV 2023arXiv
0
citations

Pluralistic Aging Diffusion Autoencoder

ICCV 2023arXiv
0
citations

Hierarchical Face Aging through Disentangled Latent Characteristics

ECCV 2020
0
citations

A Balanced and Uncertainty-aware Approach for Partial Domain Adaptation

ECCV 2020
0
citations

TF-NAS: Rethinking Three Search Freedoms of Latency-Constrained Differentiable Neural Architecture Search

ECCV 2020
0
citations

Informative Sample Mining Network for Multi-Domain Image-to-Image Translation

ECCV 2020
0
citations

Wavelet-SRNet: A Wavelet-Based CNN for Multi-Scale Face Super Resolution

ICCV 2017
0
citations

Do We Really Need Curated Malicious Data for Safety Alignment in Multi-modal Large Language Models?

CVPR 2025
0
citations

Cooperative Pseudo Labeling for Unsupervised Federated Classification

ICCV 2025
0
citations

Towards Robust Defense against Customization via Protective Perturbation Resistant to Diffusion-based Purification

ICCV 2025
0
citations

Rectifying Magnitude Neglect in Linear Attention

ICCV 2025
0
citations

Exploring Vacant Classes in Label-Skewed Federated Learning

AAAI 2025
0
citations

Protecting Model Adaptation from Trojans in the Unlabeled Data

AAAI 2025
0
citations

Heterogeneous Test-Time Training for Multi-Modal Person Re-identification

AAAI 2024
0
citations

IntroVAE: Introspective Variational Autoencoders for Photographic Image Synthesis

NeurIPS 2018
0
citations

Learning a High Fidelity Pose Invariant Model for High-resolution Face Frontalization

NeurIPS 2018
0
citations

Dual Variational Generation for Low Shot Heterogeneous Face Recognition

NeurIPS 2019
0
citations

AOT: Appearance Optimal Transport Based Identity Swapping for Forgery Detection

NeurIPS 2020
0
citations

Orthogonal Transformer: An Efficient Vision Transformer Backbone with Token Orthogonalization

NeurIPS 2022
0
citations

Are You Stealing My Model? Sample Correlation for Fingerprinting Deep Neural Networks

NeurIPS 2022
0
citations

Lightweight Vision Transformer with Bidirectional Interaction

NeurIPS 2023
0
citations

Learning-to-Rank Meets Language: Boosting Language-Driven Ordering Alignment for Ordinal Classification

NeurIPS 2023
0
citations