Shoubin Yu
5
Papers
24
Total Citations
Papers (5)
CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular Fusion
ICLR 2025
15
citations
Bootstrapping Language-Guided Navigation Learning with Self-Refining Data Flywheel
ICLR 2025
9
citations
VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos
CVPR 2025
0
citations
Motion-Grounded Video Reasoning: Understanding and Perceiving Motion at Pixel Level
CVPR 2025
0
citations
VEGGIE: Instructional Editing and Reasoning Video Concepts with Grounded Generation
ICCV 2025
0
citations