Vicente Ordonez

20
Papers
64
Total Citations

Papers (20)

ElasticDiffusion: Training-free Arbitrary Size Image Generation through Global-Local Content Separation

CVPR 2024
37
citations

Grounding Language Models for Visual Entity Recognition

ECCV 2024
13
citations

ViC-MAE: Self-Supervised Representation Learning from Images and Video with Contrastive Masked Autoencoders

ECCV 2024
11
citations

LOCORE: Image Re-ranking with Long-Context Sequence Modeling

CVPR 2025
2
citations

Improving Large Vision and Language Models by Learning from a Panel of Peers

ICCV 2025
1
citations

Text2Scene: Generating Compositional Scenes From Textual Descriptions

CVPR 2019
0
citations

General Multi-Label Image Classification With Transformers

CVPR 2021arXiv
0
citations

Black-Box Explanation of Object Detectors via Saliency Maps

CVPR 2021arXiv
0
citations

SimVQA: Exploring Simulated Environments for Visual Question Answering

CVPR 2022
0
citations

Improving Visual Grounding by Encouraging Consistent Gradient-Based Explanations

CVPR 2023arXiv
0
citations

Balanced Datasets Are Not Enough: Estimating and Mitigating Gender Bias in Deep Image Representations

ICCV 2019
0
citations

Instance-Level Image Retrieval Using Reranking Transformers

ICCV 2021arXiv
0
citations

MEDIRL: Predicting the Visual Attention of Drivers via Maximum Entropy Deep Inverse Reinforcement Learning

ICCV 2021arXiv
0
citations

Going Beyond Nouns With Vision & Language Models Using Synthetic Data

ICCV 2023arXiv
0
citations

Generative-Discriminative Feature Representations for Open-Set Recognition

CVPR 2020
0
citations

AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation

ICCV 2025
0
citations

Improved Visual Grounding through Self-Consistent Explanations

CVPR 2024
0
citations

Commonly Uncommon: Semantic Sparsity in Situation Recognition

CVPR 2017arXiv
0
citations

Feedback-Prop: Convolutional Neural Network Inference Under Partial Evidence

CVPR 2018arXiv
0
citations

Drill-down: Interactive Retrieval of Complex Scenes using Natural Language Queries

NeurIPS 2019
0
citations