Shu-Tao Xia
24
Papers
174
Total Citations
Papers (24)
BadCLIP: Trigger-Aware Prompt Learning for Backdoor Attacks on CLIP
CVPR 2024
68
citations
Towards Compact 3D Representations via Point Feature Enhancement Masked Autoencoders
AAAI 2024arXiv
61
citations
Towards Faithful XAI Evaluation via Generalization-Limited Backdoor Watermark
ICLR 2024
18
citations
Error-quantified Conformal Inference for Time Series
ICLR 2025arXiv
8
citations
Protecting Your Video Content: Disrupting Automated Video-based LLM Annotations
CVPR 2025
4
citations
Enhancing Partially Relevant Video Retrieval with Hyperbolic Learning
ICCV 2025
4
citations
Controller-Guided Partial Label Consistency Regularization with Unlabeled Data
AAAI 2024arXiv
3
citations
Hierarchical Features Matter: A Deep Exploration of Progressive Parameterization Method for Dataset Distillation
CVPR 2025
3
citations
Efficient Self-Supervised Video Hashing with Selective State Spaces
AAAI 2025
3
citations
Embracing Collaboration Over Competition: Condensing Multiple Prompts for Visual In-Context Learning
CVPR 2025
2
citations
One Perturbation is Enough: On Generating Universal Adversarial Perturbations against Vision-Language Pre-training Models
ICCV 2025
0
citations
Cassic: Towards Content-Adaptive State-Space Models for Learned Image Compression
ICCV 2025
0
citations
GCD-Sampling: A General Cross-scale Decoupled Sampling for Point Cloud
AAAI 2025
0
citations
FastVAR: Linear Visual Autoregressive Modeling via Cached Token Pruning
ICCV 2025
0
citations
Diffusion Prior Interpolation for Flexibility Real-World Face Super-Resolution
AAAI 2025
0
citations
CALF: Aligning LLMs for Time Series Forecasting via Cross-modal Fine-Tuning
AAAI 2025
0
citations
Pre-Trained Vision-Language Models as Noisy Partial Annotators
AAAI 2025
0
citations
Adapting Pre-trained 3D Models for Point Cloud Video Understanding via Cross-frame Spatio-temporal Perception
CVPR 2025
0
citations
Vision-Language Pre-training with Object Contrastive Learning for 3D Scene Understanding
AAAI 2024
0
citations
MambaIRv2: Attentive State Space Restoration
CVPR 2025
0
citations
GMMFormer: Gaussian-Mixture-Model Based Transformer for Efficient Partially Relevant Video Retrieval
AAAI 2024
0
citations
Not All Prompts Are Secure: A Switchable Backdoor Attack Against Pre-trained Vision Transfomers
CVPR 2024
0
citations
PMA: Towards Parameter-Efficient Point Cloud Understanding via Point Mamba Adapter
CVPR 2025
0
citations
AutoSSVH: Exploring Automated Frame Sampling for Efficient Self-Supervised Video Hashing
CVPR 2025
0
citations