Xiufeng Song
4
Papers
43
Total Citations
Papers (4)
Rethinking Vision-Language Model in Face Forensics: Multi-Modal Interpretable Forged Face Detector
CVPR 2025arXiv
24
citations
RoboFactory: Exploring Embodied Agent Collaboration with Compositional Constraints
ICCV 2025
11
citations
VIKI‑R: Coordinating Embodied Multi-Agent Cooperation via Reinforcement Learning
NeurIPS 2025arXiv
8
citations
UniSTD: Towards Unified Spatio-Temporal Learning across Diverse Disciplines
CVPR 2025
0
citations