Md Mohaiminul Islam
4
Papers
91
Total Citations
Papers (4)
Video ReCap: Recursive Captioning of Hour-Long Videos
CVPR 2024
82
citations
ReVisionLLM: Recursive Vision-Language Model for Temporal Grounding in Hour-Long Videos
CVPR 2025
9
citations
BIMBA: Selective-Scan Compression for Long-Range Video Question Answering
CVPR 2025
0
citations
Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
CVPR 2024
0
citations