Gedas Bertasius
8
Papers
129
Total Citations
Papers (8)
Video ReCap: Recursive Captioning of Hour-Long Videos
CVPR 2024
82
citations
ReAgent-V: A Reward-Driven Multi-Agent Framework for Video Understanding
NeurIPS 2025
29
citations
ReVisionLLM: Recursive Vision-Language Model for Temporal Grounding in Hour-Long Videos
CVPR 2025
9
citations
BASKET: A Large-Scale Video Dataset for Fine-Grained Skill Estimation
CVPR 2025
9
citations
Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
CVPR 2024
0
citations
BIMBA: Selective-Scan Compression for Long-Range Video Question Answering
CVPR 2025
0
citations
LoCoNet: Long-Short Context Network for Active Speaker Detection
CVPR 2024
0
citations
VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos
CVPR 2025
0
citations