Gedas Bertasius
9
Papers
138
Total Citations
Papers (9)
Video ReCap: Recursive Captioning of Hour-Long Videos
CVPR 2024
82
citations
ReAgent-V: A Reward-Driven Multi-Agent Framework for Video Understanding
NeurIPS 2025arXiv
28
citations
Propose, Assess, Search: Harnessing LLMs for Goal-Oriented Planning in Instructional Videos
ECCV 2024arXiv
10
citations
ReVisionLLM: Recursive Vision-Language Model for Temporal Grounding in Hour-Long Videos
CVPR 2025
9
citations
BASKET: A Large-Scale Video Dataset for Fine-Grained Skill Estimation
CVPR 2025
9
citations
Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
CVPR 2024
0
citations
BIMBA: Selective-Scan Compression for Long-Range Video Question Answering
CVPR 2025
0
citations
LoCoNet: Long-Short Context Network for Active Speaker Detection
CVPR 2024
0
citations
VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos
CVPR 2025
0
citations