Shu-Tao Xia

24
Papers
174
Total Citations

Papers (24)

BadCLIP: Trigger-Aware Prompt Learning for Backdoor Attacks on CLIP

CVPR 2024
68
citations

Towards Compact 3D Representations via Point Feature Enhancement Masked Autoencoders

AAAI 2024arXiv
61
citations

Towards Faithful XAI Evaluation via Generalization-Limited Backdoor Watermark

ICLR 2024
18
citations

Error-quantified Conformal Inference for Time Series

ICLR 2025arXiv
8
citations

Protecting Your Video Content: Disrupting Automated Video-based LLM Annotations

CVPR 2025
4
citations

Enhancing Partially Relevant Video Retrieval with Hyperbolic Learning

ICCV 2025
4
citations

Controller-Guided Partial Label Consistency Regularization with Unlabeled Data

AAAI 2024arXiv
3
citations

Hierarchical Features Matter: A Deep Exploration of Progressive Parameterization Method for Dataset Distillation

CVPR 2025
3
citations

Efficient Self-Supervised Video Hashing with Selective State Spaces

AAAI 2025
3
citations

Embracing Collaboration Over Competition: Condensing Multiple Prompts for Visual In-Context Learning

CVPR 2025
2
citations

One Perturbation is Enough: On Generating Universal Adversarial Perturbations against Vision-Language Pre-training Models

ICCV 2025
0
citations

Cassic: Towards Content-Adaptive State-Space Models for Learned Image Compression

ICCV 2025
0
citations

GCD-Sampling: A General Cross-scale Decoupled Sampling for Point Cloud

AAAI 2025
0
citations

FastVAR: Linear Visual Autoregressive Modeling via Cached Token Pruning

ICCV 2025
0
citations

Diffusion Prior Interpolation for Flexibility Real-World Face Super-Resolution

AAAI 2025
0
citations

CALF: Aligning LLMs for Time Series Forecasting via Cross-modal Fine-Tuning

AAAI 2025
0
citations

Pre-Trained Vision-Language Models as Noisy Partial Annotators

AAAI 2025
0
citations

Adapting Pre-trained 3D Models for Point Cloud Video Understanding via Cross-frame Spatio-temporal Perception

CVPR 2025
0
citations

Vision-Language Pre-training with Object Contrastive Learning for 3D Scene Understanding

AAAI 2024
0
citations

MambaIRv2: Attentive State Space Restoration

CVPR 2025
0
citations

GMMFormer: Gaussian-Mixture-Model Based Transformer for Efficient Partially Relevant Video Retrieval

AAAI 2024
0
citations

Not All Prompts Are Secure: A Switchable Backdoor Attack Against Pre-trained Vision Transfomers

CVPR 2024
0
citations

PMA: Towards Parameter-Efficient Point Cloud Understanding via Point Mamba Adapter

CVPR 2025
0
citations

AutoSSVH: Exploring Automated Frame Sampling for Efficient Self-Supervised Video Hashing

CVPR 2025
0
citations