Xi Li

15
Papers
92
Total Citations

Papers (15)

Prompting Segmentation with Sound Is Generalizable Audio-Visual Source Localizer

AAAI 2024arXiv
38
citations

Exploring Unbiased Deepfake Detection via Token-Level Shuffling and Mixing

AAAI 2025
24
citations

ScanFormer: Referring Expression Comprehension by Iteratively Scanning

CVPR 2024
15
citations

Envisioning Class Entity Reasoning by Large Language Models for Few-shot Learning

AAAI 2025
11
citations

Boosting Vision Semantic Density with Anatomy Normality Modeling for Medical Vision-language Pre-training

ICCV 2025
2
citations

Energy-Guided Optimization for Personalized Image Editing with Pretrained Text-to-Image Diffusion Models

AAAI 2025
1
citations

Libra-Merging: Importance-redundancy and Pruning-merging Trade-off for Acceleration Plug-in in Large Vision-Language Model

CVPR 2025
1
citations

BEVSpread: Spread Voxel Pooling for Bird’s-Eye-View Representation in Vision-based Roadside 3D Object Detection

CVPR 2024
0
citations

PiD: Generalized AI-Generated Images Detection with Pixelwise Decomposition Residuals

ICML 2025
0
citations

EDM: Efficient Deep Feature Matching

ICCV 2025
0
citations

RealCam-I2V: Real-World Image-to-Video Generation with Interactive Complex Camera Control

ICCV 2025
0
citations

Dynamic-DINO: Fine-Grained Mixture of Experts Tuning for Real-time Open-Vocabulary Object Detection

ICCV 2025
0
citations

CustomCrafter: Customized Video Generation with Preserving Motion and Concept Composition Abilities

AAAI 2025
0
citations

Temporal-Distributed Backdoor Attack against Video Based Action Recognition

AAAI 2024
0
citations

Virtual Immunohistochemistry Staining for Histological Images Assisted by Weakly-supervised Learning

CVPR 2024
0
citations