Xiaoshuai Sun

15
Papers
182
Total Citations

Papers (15)

Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation

CVPR 2024
89
citations

Towards General Visual-Linguistic Face Forgery Detection

CVPR 2025
34
citations

Towards Efficient Diffusion-Based Image Editing with Instant Attention Masks

AAAI 2024arXiv
19
citations

AIGI-Holmes: Towards Explainable and Generalizable AI-Generated Image Detection via Multimodal Large Language Models

ICCV 2025arXiv
13
citations

Exploring Phrase-Level Grounding with Text-to-Image Diffusion Model

ECCV 2024arXiv
9
citations

StoryWeaver: A Unified World Model for Knowledge-Enhanced Story Character Customization

AAAI 2025
6
citations

Accelerating Multimodal Large Language Models via Dynamic Visual-Token Exit and the Empirical Findings

NeurIPS 2025arXiv
5
citations

FlashSloth : Lightning Multimodal Large Language Models via Embedded Visual Compression

CVPR 2025
4
citations

IPDN: Image-enhanced Prompt Decoding Network for 3D Referring Expression Segmentation

AAAI 2025
3
citations

X-RefSeg3D: Enhancing Referring 3D Instance Segmentation via Structured Cross-Modal Graph Neural Networks

AAAI 2024
0
citations

ACL: Activating Capability of Linear Attention for Image Restoration

CVPR 2025
0
citations

X-Oscar: A Progressive Framework for High-quality Text-guided 3D Animatable Avatar Generation

ICML 2024
0
citations

Evaluating and Analyzing Relationship Hallucinations in Large Vision-Language Models

ICML 2024
0
citations

SAM as the Guide: Mastering Pseudo-Label Refinement in Semi-Supervised Referring Expression Segmentation

ICML 2024
0
citations

Fast Text-to-3D-Aware Face Generation and Manipulation via Direct Cross-modal Mapping and Geometric Regularization

ICML 2024
0
citations