ResearchAlpha Leak

Conferences Topics Top Authors Rankings Browse All

Home/Authors/Lorenzo Baraldi

Lorenzo Baraldi

Topic trends: 31,945 papers · similarity ≥ 0.4 · year ≥ 2024 · Data sourced from Semantic Scholar

34,180 papers | Abstracts: 30,895 (90.4%) | Citations: 34,180 (100.0%) | arXiv: 25,727 (75.3%)

Built: Feb 6, 2026, 1:36 PM AMS

10

papers

38

total citations

papers (10)

Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabulary Segmentation

Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering

Recurrence-Enhanced Vision-and-Language Transformers for Robust Multimodal Document Retrieval

What Changed? Detecting and Evaluating Instruction-Guided Image Edits with Multimodal Large Language Models

Hyperbolic Safety-Aware Vision-Language Models

MissRAG: Addressing the Missing Modality Challenge in Multimodal Large Language Models

Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation

Meshed-Memory Transformer for Image Captioning

Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation

With a Little Help from Your Own Past: Prototypical Memory Networks for Image Captioning