Xiaoshuai Sun
15
Papers
182
Total Citations
Papers (15)
Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation
CVPR 2024
89
citations
Towards General Visual-Linguistic Face Forgery Detection
CVPR 2025
34
citations
Towards Efficient Diffusion-Based Image Editing with Instant Attention Masks
AAAI 2024arXiv
19
citations
AIGI-Holmes: Towards Explainable and Generalizable AI-Generated Image Detection via Multimodal Large Language Models
ICCV 2025arXiv
13
citations
Exploring Phrase-Level Grounding with Text-to-Image Diffusion Model
ECCV 2024arXiv
9
citations
StoryWeaver: A Unified World Model for Knowledge-Enhanced Story Character Customization
AAAI 2025
6
citations
Accelerating Multimodal Large Language Models via Dynamic Visual-Token Exit and the Empirical Findings
NeurIPS 2025arXiv
5
citations
FlashSloth : Lightning Multimodal Large Language Models via Embedded Visual Compression
CVPR 2025
4
citations
IPDN: Image-enhanced Prompt Decoding Network for 3D Referring Expression Segmentation
AAAI 2025
3
citations
X-RefSeg3D: Enhancing Referring 3D Instance Segmentation via Structured Cross-Modal Graph Neural Networks
AAAI 2024
0
citations
ACL: Activating Capability of Linear Attention for Image Restoration
CVPR 2025
0
citations
X-Oscar: A Progressive Framework for High-quality Text-guided 3D Animatable Avatar Generation
ICML 2024
0
citations
Evaluating and Analyzing Relationship Hallucinations in Large Vision-Language Models
ICML 2024
0
citations
SAM as the Guide: Mastering Pseudo-Label Refinement in Semi-Supervised Referring Expression Segmentation
ICML 2024
0
citations
Fast Text-to-3D-Aware Face Generation and Manipulation via Direct Cross-modal Mapping and Geometric Regularization
ICML 2024
0
citations