Federico Tombari

85
Papers
229
Total Citations

Papers (85)

SecondPose: SE(3)-Consistent Dual-Stream Feature Fusion for Category-Level Pose Estimation

CVPR 2024
53
citations

LayoutVLM: Differentiable Optimization of 3D Layout via Vision-Language Models

CVPR 2025
44
citations

Learning to Prompt with Text Only Supervision for Vision-Language Models

AAAI 2025
40
citations

CubeDiff: Repurposing Diffusion-Based Image Models for Panorama Generation

ICLR 2025arXiv
20
citations

Active Data Curation Effectively Distills Large-Scale Multimodal Models

CVPR 2025
14
citations

Omnia de EgoTempo: Benchmarking Temporal Understanding of Multi-Modal LLMs in Egocentric Videos

CVPR 2025
14
citations

LoRACLR: Contrastive Adaptation for Customization of Diffusion Models

CVPR 2025
11
citations

Semantic Library Adaptation: LoRA Retrieval and Fusion for Open-Vocabulary Semantic Segmentation

CVPR 2025
6
citations

One2Any: One-Reference 6D Pose Estimation for Any Object

CVPR 2025
5
citations

Video Perception Models for 3D Scene Synthesis

NeurIPS 2025
5
citations

Test-Time Visual In-Context Tuning

CVPR 2025
4
citations

Gatekeeper: Improving Model Cascades Through Confidence Tuning

NeurIPS 2025arXiv
4
citations

4D Gaussian Splatting SLAM

ICCV 2025
3
citations

KP-RED: Exploiting Semantic Keypoints for Joint 3D Shape Retrieval and Deformation

CVPR 2024
3
citations

Prior2Former - Evidential Modeling of Mask Transformers for Assumption-Free Open-World Panoptic Segmentation

ICCV 2025arXiv
2
citations

UIP2P: Unsupervised Instruction-based Image Editing via Edit Reversibility Constraint

ICCV 2025arXiv
1
citations

Query-Guided End-To-End Person Search

CVPR 2019
0
citations

3D Point Capsule Networks

CVPR 2019
0
citations

GFrames: Gradient-Based Local Reference Frame for 3D Shape Matching

CVPR 2019
0
citations

Learning 3D Semantic Scene Graphs From 3D Indoor Reconstructions

CVPR 2020arXiv
0
citations

Semantic Image Manipulation Using Scene Graphs

CVPR 2020arXiv
0
citations

Learning Graph Embeddings for Compositional Zero-Shot Learning

CVPR 2021arXiv
0
citations

Variational Transformer Networks for Layout Generation

CVPR 2021arXiv
0
citations

SceneGraphFusion: Incremental 3D Scene Graph Prediction From RGB-D Sequences

CVPR 2021
0
citations

GDR-Net: Geometry-Guided Direct Regression Network for Monocular 6D Object Pose Estimation

CVPR 2021
0
citations

ZebraPose: Coarse To Fine Surface Encoding for 6DoF Object Pose Estimation

CVPR 2022arXiv
0
citations

3D-VField: Adversarial Augmentation of Point Clouds for Domain Generalization in 3D Object Detection

CVPR 2022
0
citations

Bending Graphs: Hierarchical Shape Matching Using Gated Optimal Transport

CVPR 2022arXiv
0
citations

Learning Local Displacements for Point Cloud Completion

CVPR 2022arXiv
0
citations

GPV-Pose: Category-Level Object Pose Estimation via Geometry-Guided Point-Wise Voting

CVPR 2022
0
citations

SHIFT: A Synthetic Driving Dataset for Continuous Multi-Task Domain Adaptation

CVPR 2022
0
citations

Shape, Pose, and Appearance From a Single Image via Bootstrapped Radiance Field Inversion

CVPR 2023arXiv
0
citations

Incremental 3D Semantic Scene Graph Prediction From RGB Sequences

CVPR 2023arXiv
0
citations

IPCC-TP: Utilizing Incremental Pearson Correlation Coefficient for Joint Multi-Agent Trajectory Prediction

CVPR 2023
0
citations

I2MVFormer: Large Language Model Generated Multi-View Document Supervision for Zero-Shot Image Classification

CVPR 2023arXiv
0
citations

SPARF: Neural Radiance Fields From Sparse and Noisy Poses

CVPR 2023arXiv
0
citations

A Versatile Learning-Based 3D Temporal Tracker: Scalable, Robust, Online

ICCV 2015
0
citations

Learning a Descriptor-Specific 3D Keypoint Detector

ICCV 2015
0
citations

SSD-6D: Making RGB-Based 3D Detection and 6D Pose Estimation Great Again

ICCV 2017
0
citations

Learning in an Uncertain World: Representing Ambiguity Through Multiple Hypotheses

ICCV 2017arXiv
0
citations

Long Short-Term Memory Kalman Filters: Recurrent Neural Estimators for Pose Regularization

ICCV 2017arXiv
0
citations

Sampling-Free Epistemic Uncertainty Estimation Using Approximated Variance Propagation

ICCV 2019
0
citations

Object-Driven Multi-Layer Scene Decomposition From a Single Image

ICCV 2019
0
citations

Explaining the Ambiguity of Object Detection and 6D Pose From Visual Data

ICCV 2019
0
citations

RIO: 3D Object Instance Re-Localization in Changing Indoor Environments

ICCV 2019
0
citations

ForkNet: Multi-Branch Volumetric Semantic Completion From a Single Depth Image

ICCV 2019
0
citations

SO-Pose: Exploiting Self-Occlusion for Direct 6D Pose Estimation

ICCV 2021
0
citations

Graph-to-3D: End-to-End Generation and Manipulation of 3D Scenes Using Scene Graphs

ICCV 2021
0
citations

UNOPose: Unseen Object Pose Estimation with an Unposed RGB-D Reference Image

CVPR 2025
0
citations

Dynamic Hyperbolic Attention Network for Fine Hand-object Reconstruction

ICCV 2023arXiv
0
citations

SparseFusion: Fusing Multi-Modal Sparse Representations for Multi-Sensor 3D Object Detection

ICCV 2023arXiv
0
citations

U-RED: Unsupervised 3D Shape Retrieval and Deformation for Partial Point Clouds

ICCV 2023
0
citations

Introducing Language Guidance in Prompt-based Continual Learning

ICCV 2023arXiv
0
citations

Segmenting Known Objects and Unseen Unknowns without Prior Knowledge

ICCV 2023arXiv
0
citations

Robust Monocular Depth Estimation under Challenging Conditions

ICCV 2023arXiv
0
citations

Quaternion Equivariant Capsule Networks for 3D Point Clouds

ECCV 2020
0
citations

Self6D: Self-Supervised Monocular 6D Object Pose Estimation

ECCV 2020
0
citations

SoftPoolNet: Shape Descriptor for Point Cloud Completion and Classification

ECCV 2020
0
citations

Beyond Controlled Environments: 3D Camera Re-Localization in Changing Indoor Scenes

ECCV 2020
0
citations

Deep Positional and Relational Feature Learning for Rotation-Invariant Point Cloud Analysis

ECCV 2020
0
citations

RBP-Pose: Residual Bounding Box Projection for Category-Level Pose Estimation

ECCV 2022
0
citations

E-Graph: Minimal Solution for Rigid Rotation with Extensibility Graphs

ECCV 2022
0
citations

Implicit Neural Representations for Image Compression

ECCV 2022arXiv
0
citations

3D Compositional Zero-Shot Learning with DeCompositional Consensus

ECCV 2022
0
citations

GOCA: Guided Online Cluster Assignment for Self-Supervised Video Representation Learning

ECCV 2022
0
citations

Unconditional Scene Graph Generation

ICCV 2021arXiv
0
citations

RelationField: Relate Anything in Radiance Fields

CVPR 2025
0
citations

ESCAPE: Equivariant Shape Completion via Anchor Point Encoding

CVPR 2025
0
citations

MOBIUS: Big-to-Mobile Universal Instance Segmentation via Multi-modal Bottleneck Fusion and Calibrated Decoder Pruning

ICCV 2025
0
citations

Contrastive Test-Time Composition of Multiple LoRA Models for Image Generation

ICCV 2025
0
citations

Hierarchical 3D Scene Graphs Construction Outdoors

ICCV 2025
0
citations

Object-X: Learning to Reconstruct Multi-Modal 3D Object Representations

NeurIPS 2025
0
citations

SceneFun3D: Fine-Grained Functionality and Affordance Understanding in 3D Scenes

CVPR 2024
0
citations

CONFORM: Contrast is All You Need for High-Fidelity Text-to-Image Diffusion Models

CVPR 2024
0
citations

Know Your Neighbors: Improving Single-View Reconstruction via Spatial Vision-Language Reasoning

CVPR 2024
0
citations

MOHO: Learning Single-view Hand-held Object Reconstruction with Multi-view Occlusion-Aware Supervision

CVPR 2024
0
citations

HyperSDFusion: Bridging Hierarchical Structures in Language and Geometry for Enhanced 3D Text2Shape Generation

CVPR 2024
0
citations

Extracting Training Data From Document-Based VQA Models

ICML 2024
0
citations

Real-Time 3D Model Tracking in Color and Depth on a Single CPU Core

CVPR 2017arXiv
0
citations

CNN-SLAM: Real-Time Dense Monocular SLAM With Learned Depth Prediction

CVPR 2017
0
citations

Guide Me: Interacting With Deep Networks

CVPR 2018arXiv
0
citations

I2DFormer: Learning Image to Document Attention for Zero-Shot Image Classification

NeurIPS 2022
0
citations

CommonScenes: Generating Commonsense 3D Indoor Scenes with Scene Graph Diffusion

NeurIPS 2023
0
citations

DDF-HO: Hand-Held Object Reconstruction via Conditional Directed Distance Field

NeurIPS 2023
0
citations

OpenMask3D: Open-Vocabulary 3D Instance Segmentation

NeurIPS 2023
0
citations