Yunhao Gou
3
Papers
44
Total Citations
Papers (3)
EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions
CVPR 2025
44
citations
Leveraging per Image-Token Consistency for Vision-Language Pre-Training
CVPR 2023arXiv
0
citations
Exploring Hierarchical Graph Representation for Large-Scale Zero-Shot Image Classification
ECCV 2022
0
citations