Shiguang Shan

16
Papers
212
Total Citations

Papers (16)

Autoregressive Video Generation without Vector Quantization

ICLR 2025arXiv
101
citations

HPNet: Dynamic Trajectory Forecasting with Historical Prediction Attention

CVPR 2024
61
citations

Tokenize Anything via Prompting

ECCV 2024arXiv
35
citations

UniPose: A Unified Multimodal Framework for Human Pose Comprehension, Generation and Editing

CVPR 2025
14
citations

An Information Theoretical View for Out-Of-Distribution Detection

ECCV 2024
1
citations

Benchmarking Multimodal Large Language Models Against Image Corruptions

ICCV 2025
0
citations

HIS-GPT: Towards 3D Human-In-Scene Multimodal Understanding

ICCV 2025
0
citations

Feature Decomposition-Recomposition in Large Vision-Language Model for Few-Shot Class-Incremental Learning

ICCV 2025
0
citations

Pre-trained Model Guided Fine-Tuning for Zero-Shot Adversarial Robustness

CVPR 2024
0
citations

ES³: Evolving Self-Supervised Learning of Robust Audio-Visual Speech Representations

CVPR 2024
0
citations

Face Forgery Video Detection via Temporal Forgery Cue Unraveling

CVPR 2025
0
citations

Video Harmonization with Triplet Spatio-Temporal Variation Patterns

CVPR 2024
0
citations

Not Only Vision: Evolve Visual Speech Recognition via Peripheral Information

ICCV 2025
0
citations

EfficientMT: Efficient Temporal Adaptation for Motion Transfer in Text-to-Video Diffusion Models

ICCV 2025
0
citations

CogCM: Cognition-Inspired Contextual Modeling for Audio-Visual Speech Enhancement

ICCV 2025
0
citations

G2PDiffusion: Cross-species Genotype-to-Phenotype Prediction via Evolutionary Diffusion

ICCV 2025
0
citations