Shiguang Shan

74
Papers
177
Total Citations

Papers (74)

Autoregressive Video Generation without Vector Quantization

ICLR 2025arXiv
101
citations

HPNet: Dynamic Trajectory Forecasting with Historical Prediction Attention

CVPR 2024
61
citations

UniPose: A Unified Multimodal Framework for Human Pose Comprehension, Generation and Editing

CVPR 2025
14
citations

An Information Theoretical View for Out-Of-Distribution Detection

ECCV 2024
1
citations

G2PDiffusion: Cross-species Genotype-to-Phenotype Prediction via Evolutionary Diffusion

ICCV 2025
0
citations

Benchmarking Multimodal Large Language Models Against Image Corruptions

ICCV 2025
0
citations

HIS-GPT: Towards 3D Human-In-Scene Multimodal Understanding

ICCV 2025
0
citations

Feature Decomposition-Recomposition in Large Vision-Language Model for Few-Shot Class-Incremental Learning

ICCV 2025
0
citations

Pre-trained Model Guided Fine-Tuning for Zero-Shot Adversarial Robustness

CVPR 2024
0
citations

ES³: Evolving Self-Supervised Learning of Robust Audio-Visual Speech Representations

CVPR 2024
0
citations

Video Harmonization with Triplet Spatio-Temporal Variation Patterns

CVPR 2024
0
citations

Projection Metric Learning on Grassmann Manifold With Application to Video Based Face Recognition

CVPR 2015
0
citations

Shape Driven Kernel Adaptation in Convolutional Neural Network for Robust Facial Traits Recognition

CVPR 2015
0
citations

Discriminant Analysis on Riemannian Manifold of Gaussian Distributions for Face Recognition With Image Sets

CVPR 2015
0
citations

Face Video Retrieval With Image Query via Hashing Across Euclidean Space and Riemannian Manifold

CVPR 2015
0
citations

Deep Supervised Hashing for Fast Image Retrieval

CVPR 2016
0
citations

Occlusion-Free Face Alignment: Deep Regression Networks Coupled With De-Corrupt AutoEncoders

CVPR 2016
0
citations

Multi-View Deep Network for Cross-View Classification

CVPR 2016
0
citations

Learning Multifunctional Binary Codes for Both Category and Attribute Oriented Retrieval Tasks

CVPR 2017
0
citations

Discriminative Covariance Oriented Representation Learning for Face Recognition With Image Sets

CVPR 2017
0
citations

Duplex Generative Adversarial Network for Unsupervised Domain Adaptation

CVPR 2018
0
citations

Real-Time Rotation-Invariant Face Detection With Progressive Calibration Networks

CVPR 2018arXiv
0
citations

Mean-Variance Loss for Deep Age Estimation From a Face

CVPR 2018
0
citations

Structure Inference Net: Object Detection Using Scene-Level Context and Instance-Level Relationships

CVPR 2018arXiv
0
citations

VRSTC: Occlusion-Free Video Person Re-Identification

CVPR 2019
0
citations

Exploring Context and Visual Pattern of Relationship for Scene Graph Generation

CVPR 2019
0
citations

Fully Learnable Group Convolution for Acceleration of Deep Neural Networks

CVPR 2019
0
citations

Interaction-And-Aggregation Network for Person Re-Identification

CVPR 2019
0
citations

Self-Supervised Representation Learning From Videos for Facial Action Unit Detection

CVPR 2019
0
citations

Weakly Supervised Image Classification Through Noise Regularization

CVPR 2019
0
citations

Local Relationship Learning With Person-Specific Shape Regularization for Facial Action Unit Detection

CVPR 2019
0
citations

Unsupervised Domain Adaptation With Hierarchical Gradient Synchronization

CVPR 2020
0
citations

Cross-Domain Face Presentation Attack Detection via Multi-Domain Disentangled Representation Learning

CVPR 2020arXiv
0
citations

Single-Side Domain Generalization for Face Anti-Spoofing

CVPR 2020arXiv
0
citations

Self-Supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation

CVPR 2020arXiv
0
citations

TCTS: A Task-Consistent Two-Stage Framework for Person Search

CVPR 2020
0
citations

Multi-Modal Graph Neural Network for Joint Reasoning on Vision and Scene Text

CVPR 2020arXiv
0
citations

BiCnet-TKS: Learning Efficient Spatial-Temporal Representation for Video Person Re-Identification

CVPR 2021
0
citations

Clothes-Changing Person Re-Identification With RGB Modality Only

CVPR 2022arXiv
0
citations

Enhancing Face Recognition With Self-Supervised 3D Reconstruction

CVPR 2022
0
citations

DISC: Learning From Noisy Labels via Dynamic Instance-Specific Selection and Correction

CVPR 2023
0
citations

Source-Free Adaptive Gaze Estimation by Uncertainty Reduction

CVPR 2023
0
citations

Diversity-Measurable Anomaly Detection

CVPR 2023arXiv
0
citations

A Unified Multiplicative Framework for Attribute Learning

ICCV 2015
0
citations

Leveraging Datasets With Varying Annotations for Face Alignment via Deep Regression Network

ICCV 2015
0
citations

Two Birds, One Stone: Jointly Learning Binary Code for Large-Scale Face Image Retrieval and Attributes Prediction

ICCV 2015
0
citations

Bi-Shifting Auto-Encoder for Unsupervised Domain Adaptation

ICCV 2015
0
citations

Recursive Spatial Transformer (ReST) for Alignment-Free Face Recognition

ICCV 2017
0
citations

Learning Discriminative Latent Attributes for Zero-Shot Classification

ICCV 2017
0
citations

S2GAN: Share Aging Factors Across Ages and Share Aging Trends Among Individuals

ICCV 2019
0
citations

Temporal Knowledge Propagation for Image-to-Video Person Re-Identification

ICCV 2019
0
citations

Face Forgery Video Detection via Temporal Forgery Cue Unraveling

CVPR 2025
0
citations

Transferable Contrastive Network for Generalized Zero-Shot Learning

ICCV 2019
0
citations

Meta Gradient Adversarial Attack

ICCV 2021arXiv
0
citations

EigenGAN: Layer-Wise Eigen-Learning for GANs

ICCV 2021arXiv
0
citations

Cross-Encoder for Unsupervised Gaze Representation Learning

ICCV 2021
0
citations

DandelionNet: Domain Composition with Instance Adaptive Classification for Domain Generalization

ICCV 2023
0
citations

Holistic Label Correction for Noisy Multi-Label Classification

ICCV 2023
0
citations

Video-based Remote Physiological Measurement via Cross-verified Feature Disentangling

ECCV 2020
0
citations

Sketching Image Gist: Human-Mimetic Hierarchical Scene Graph Generation

ECCV 2020
0
citations

Temporal Complementary Learning for Video Person Re-Identification

ECCV 2020
0
citations

Adaptive Image Transformations for Transfer-Based Adversarial Attack

ECCV 2022
0
citations

GAN with Multivariate Disentangling for Controllable Hair Editing

ECCV 2022
0
citations

Joint Feature Learning and Relation Modeling for Tracking: A One-Stream Framework

ECCV 2022
0
citations

Weakly Supervised Object Detection With Segmentation Collaboration

ICCV 2019
0
citations

Not Only Vision: Evolve Visual Speech Recognition via Peripheral Information

ICCV 2025
0
citations

EfficientMT: Efficient Temporal Adaptation for Motion Transfer in Text-to-Video Diffusion Models

ICCV 2025
0
citations

CogCM: Cognition-Inspired Contextual Modeling for Audio-Visual Speech Enhancement

ICCV 2025
0
citations

Cross Attention Network for Few-shot Classification

NeurIPS 2019
0
citations

Multi-label Co-regularization for Semi-supervised Facial Action Unit Recognition

NeurIPS 2019
0
citations

Optimal Positive Generation via Latent Transformation for Contrastive Learning

NeurIPS 2022
0
citations

Understanding Few-Shot Learning: Measuring Task Relatedness and Adaptation Difficulty via Attributes

NeurIPS 2023
0
citations

Generalized Semi-Supervised Learning via Self-Supervised Feature Adaptation

NeurIPS 2023
0
citations

Log-Euclidean Metric Learning on Symmetric Positive Definite Manifold with Application to Image Set Classification

ICML 2015
0
citations