Guohao Sun
5
Papers
93
Total Citations
Papers (5)
Text Is MASS: Modeling as Stochastic Embedding for Text-Video Retrieval
CVPR 2024
63
citations
SQ-LLaVA: Self-Questioning for Large Vision-Language Assistant
ECCV 2024arXiv
23
citations
Latent Chain-of-Thought for Visual Reasoning
NeurIPS 2025arXiv
7
citations
Structured Policy Optimization: Enhance Large Vision-Language Model via Self-referenced Dialogue
ICCV 2025
0
citations
Prototypical Transformer As Unified Motion Learners
ICML 2024
0
citations