α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Lorenzo Baraldi
Lorenzo Baraldi
10
papers
38
total citations
papers (10)
Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabulary Segmentation
ICCV 2025
arXiv
22
citations
Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering
CVPR 2025
10
citations
Recurrence-Enhanced Vision-and-Language Transformers for Robust Multimodal Document Retrieval
CVPR 2025
arXiv
4
citations
What Changed? Detecting and Evaluating Instruction-Guided Image Edits with Multimodal Large Language Models
ICCV 2025
arXiv
2
citations
Hyperbolic Safety-Aware Vision-Language Models
CVPR 2025
0
citations
MissRAG: Addressing the Missing Modality Challenge in Multimodal Large Language Models
ICCV 2025
0
citations
Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation
CVPR 2024
arXiv
0
citations
Meshed-Memory Transformer for Image Captioning
CVPR 2020
arXiv
0
citations
Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation
CVPR 2023
arXiv
0
citations
With a Little Help from Your Own Past: Prototypical Memory Networks for Image Captioning
ICCV 2023
arXiv
0
citations