Longteng Guo
6
Papers
37
Total Citations
Papers (6)
EVE: Efficient Vision-Language Pre-training with Masked Prediction and Modality-Aware MoE
AAAI 2024arXiv
20
citations
Needle In A Video Haystack: A Scalable Synthetic Evaluator for Video MLLMs
ICLR 2025
14
citations
Breaking the Encoder Barrier for Seamless Video-Language Understanding
ICCV 2025
3
citations
Efficient Motion-Aware Video MLLM
CVPR 2025
0
citations
SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models
CVPR 2024
0
citations
Unveiling Parts Beyond Objects: Towards Finer-Granularity Referring Expression Segmentation
CVPR 2024
0
citations