Reuben Tan
8
Papers
62
Total Citations
Papers (8)
Koala: Key Frame-Conditioned Long Video-LLM
CVPR 2024
62
citations
SITE: towards Spatial Intelligence Thorough Evaluation
ICCV 2025
0
citations
Language-Guided Audio-Visual Source Separation via Trimodal Consistency
CVPR 2023arXiv
0
citations
Language Features Matter: Effective Language Representations for Vision-Language Tasks
ICCV 2019
0
citations
Learning Similarity Conditions Without Explicit Supervision
ICCV 2019
0
citations
NewsStories: Illustrating Articles with Visual Summaries
ECCV 2022
0
citations
Magma: A Foundation Model for Multimodal AI Agents
CVPR 2025
0
citations
Look at What I’m Doing: Self-Supervised Spatial Grounding of Narrations in Instructional Videos
NeurIPS 2021
0
citations