Hilde Kuehne
8
Papers
134
Total Citations
2
Affiliations
Affiliations
Goethe University FrankfurtMIT-IBM Watson AI Lab
Papers (8)
Grounding Everything: Emerging Localization Properties in Vision-Language Transformers
CVPR 2024arXiv
74
citations
HowToCaption: Prompting LLMs to Transform Video Annotations at Scale
ECCV 2024arXiv
31
citations
LeGrad: An Explainability Method for Vision Transformers via Feature Formation Sensitivity
ICCV 2025arXiv
24
citations
CAV-MAE Sync: Improving Contrastive Audio-Visual Mask Autoencoders via Fine-Grained Alignment
CVPR 2025arXiv
2
citations
Teaching VLMs to Localize Specific Objects from In-context Examples
ICCV 2025arXiv
2
citations
Unbiasing through Textual Descriptions: Mitigating Representation Bias in Video Benchmarks
CVPR 2025arXiv
1
citations
VideoGEM: Training-free Action Grounding in Videos
CVPR 2025arXiv
0
citations
What When and Where? Self-Supervised Spatio-Temporal Grounding in Untrimmed Multi-Action Videos from Narrated Instructions
CVPR 2024arXiv
0
citations