Mubarak Shah
21
Papers
188
Total Citations
Papers (21)
Self-Distilled Masked Auto-Encoders are Efficient Video Anomaly Detectors
CVPR 2024
55
citations
M-LLM Based Video Frame Selection for Efficient Video Understanding
CVPR 2025
46
citations
Curriculum Direct Preference Optimization for Diffusion and Consistency Models
CVPR 2025arXiv
21
citations
Composed Video Retrieval via Enriched Context and Discriminative Embeddings
CVPR 2024
20
citations
X-Former: Unifying Contrastive and Reconstruction Learning for MLLMs
ECCV 2024
9
citations
VidLA: Video-Language Alignment at Scale
CVPR 2024
8
citations
Enhancing Privacy-Utility Trade-offs to Mitigate Memorization in Diffusion Models
CVPR 2025
6
citations
Open Vocabulary Multi-Label Video Classification
ECCV 2024arXiv
5
citations
FinePseudo: Improving Pseudo-Labelling through Temporal-Alignablity for Semi-Supervised Fine-Grained Action Recognition
ECCV 2024arXiv
5
citations
GAReT: Cross-view Video Geolocalization with Adapters and Auto-Regressive Transformers
ECCV 2024arXiv
5
citations
ALBAR: Adversarial Learning approach to mitigate Biases in Action Recognition
ICLR 2025
3
citations
GT-Loc: Unifying When and Where in Images through a Joint Embedding Space
ICCV 2025
2
citations
From Play to Replay: Composed Video Retrieval for Temporally Fine-Grained Videos
NeurIPS 2025arXiv
1
citations
Beyond Simple Edits: Composed Video Retrieval with Dense Modifications
ICCV 2025
1
citations
Möbius Transform for Mitigating Perspective Distortions in Representation Learning
ECCV 2024arXiv
1
citations
All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages
CVPR 2025
0
citations
No More Shortcuts: Realizing the Potential of Temporal Self-Supervision
AAAI 2024arXiv
0
citations
Test-Time Retrieval-Augmented Adaptation for Vision-Language Models
ICCV 2025
0
citations
Multiview Aerial Visual RECognition (MAVREC): Can Multi-view Improve Aerial Visual Perception?
CVPR 2024
0
citations
Robin3D: Improving 3D Large Language Model via Robust Instruction Tuning
ICCV 2025
0
citations
CoLLM: A Large Language Model for Composed Image Retrieval
CVPR 2025
0
citations