Yuchong Sun
3
Papers
9
Total Citations
Papers (3)
JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation
NeurIPS 2025arXiv
4
citations
ETVA: Evaluation of Text-to-Video Alignment via Fine-grained Question Generation and Answering
ICCV 2025
3
citations
EyEar: Learning Audio Synchronized Human Gaze Trajectory Based on Physics-Informed Dynamics
AAAI 2025
2
citations