Xu Sun
4
Papers
424
Total Citations
Papers (4)
TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding
CVPR 2024
356
citations
VITATECS: A Diagnostic Dataset for Temporal Concept Understanding of Video-Language Models
ECCV 2024
49
citations
InstructAvatar: Text-Guided Emotion and Motion Control for Avatar Generation
AAAI 2025
19
citations
VidTwin: Video VAE with Decoupled Structure and Dynamics
CVPR 2025
0
citations