Siyang Sun
4
Papers
38
Total Citations
Papers (4)
Relevant Intrinsic Feature Enhancement Network for Few-Shot Semantic Segmentation
AAAI 2024arXiv
30
citations
Aligned Better, Listen Better for Audio-Visual Large Language Models
ICLR 2025
8
citations
FuseTeacher: Modality-fused Encoders are Strong Vision Supervisors
ECCV 2024
0
citations
CrossMAE: Cross-Modality Masked Autoencoders for Region-Aware Audio-Visual Pre-Training
CVPR 2024
0
citations