Stefano Soatto

79
Papers
66
Total Citations

Papers (79)

CPR: Retrieval Augmented Generation for Copyright Protection

CVPR 2024
26
citations

Diffusion Soup: Model Merging for Text-to-Image Diffusion Models

ECCV 2024
21
citations

Enhancing Vision-Language Pre-training with Rich Supervisions

CVPR 2024
15
citations

Interpretable Measures of Conceptual Similarity by Complexity-Constrained Descriptive Auto-Encoding

CVPR 2024
4
citations

Diffeomorphic Template Registration for Atmospheric Turbulence Mitigation

CVPR 2024
0
citations

Multi-Modal Hallucination Control by Visual Information Grounding

CVPR 2024
0
citations

THRONE: An Object-based Hallucination Benchmark for the Free-form Generations of Large Vision-Language Models

CVPR 2024
0
citations

Fewer Truncations Improve Language Modeling

ICML 2024
0
citations

Sub-token ViT Embedding via Stochastic Resonance Transformers

ICML 2024
0
citations

Efficient Minimal-Surface Regularization of Perspective Depth Maps in Variational Stereo

CVPR 2015
0
citations

Texture Representations for Image and Video Synthesis

CVPR 2015
0
citations

Multi-View Feature Engineering and Learning

CVPR 2015
0
citations

Causal Video Object Segmentation From Persistence of Occlusions

CVPR 2015
0
citations

Domain-Size Pooling in Local Descriptors: DSP-SIFT

CVPR 2015
0
citations

Scaling up Image Segmentation across Data and Tasks

CVPR 2025
0
citations

Visual-Inertial-Semantic Scene Representation for 3D Object Detection

CVPR 2017
0
citations

S2F: Slow-To-Fast Interpolator Flow

CVPR 2017
0
citations

Zero Shot Learning via Multi-Scale Manifold Regularization

CVPR 2017
0
citations

OATM: Occlusion Aware Template Matching by Consensus Set Maximization

CVPR 2018arXiv
0
citations

Empirical Study of the Topology and Geometry of Deep Networks

CVPR 2018
0
citations

Unsupervised Moving Object Detection via Contextual Information Separation

CVPR 2019
0
citations

Dense Depth Posterior (DDP) From Single Image and Sparse Range

CVPR 2019
0
citations

Bilateral Cyclic Constraint and Adaptive Regularization for Unsupervised Monocular Depth Prediction

CVPR 2019
0
citations

GeoNet: Deep Geodesic Networks for Point Cloud Analysis

CVPR 2019
0
citations

Meta-Learning With Differentiable Convex Optimization

CVPR 2019
0
citations

FDA: Fourier Domain Adaptation for Semantic Segmentation

CVPR 2020arXiv
0
citations

Towards Backward-Compatible Representation Learning

CVPR 2020arXiv
0
citations

Learning to Manipulate Individual Objects in an Image

CVPR 2020arXiv
0
citations

Eternal Sunshine of the Spotless Net: Selective Forgetting in Deep Networks

CVPR 2020arXiv
0
citations

Phase Consistent Ecological Domain Adaptation

CVPR 2020arXiv
0
citations

Exponential Moving Average Normalization for Self-Supervised and Semi-Supervised Learning

CVPR 2021arXiv
0
citations

Mixed-Privacy Forgetting in Deep Networks

CVPR 2021arXiv
0
citations

Compatibility-Aware Heterogeneous Visual Search

CVPR 2021arXiv
0
citations

LQF: Linear Quadratic Fine-Tuning

CVPR 2021arXiv
0
citations

Positive-Congruent Training: Towards Regression-Free Model Updates

CVPR 2021arXiv
0
citations

DyStaB: Unsupervised Object Segmentation via Dynamic-Static Bootstrapping

CVPR 2021arXiv
0
citations

Learning Semantic-Aware Dynamics for Video Prediction

CVPR 2021arXiv
0
citations

Mixed Differential Privacy in Computer Vision

CVPR 2022arXiv
0
citations

Class-Incremental Learning With Strong Pre-Trained Models

CVPR 2022arXiv
0
citations

Task Adaptive Parameter Sharing for Multi-Task Learning

CVPR 2022arXiv
0
citations

MeMOT: Multi-Object Tracking With Memory

CVPR 2022arXiv
0
citations

Omni-DETR: Omni-Supervised Object Detection With Transformers

CVPR 2022
0
citations

Stereoscopic Universal Perturbations Across Different Architectures and Datasets

CVPR 2022arXiv
0
citations

Train/Test-Time Adaptation With Retrieval

CVPR 2023arXiv
0
citations

Critical Learning Periods for Multisensory Integration in Deep Networks

CVPR 2023arXiv
0
citations

Depth Estimation From Camera Image and mmWave Radar Point Cloud

CVPR 2023
0
citations

A-La-Carte Prompt Tuning (APT): Combining Distinct Data via Composable Prompting

CVPR 2023
0
citations

A Meta-Learning Approach to Predicting Performance and Data Requirements

CVPR 2023arXiv
0
citations

Guided Recommendation for Model Fine-Tuning

CVPR 2023
0
citations

Self-Occlusions and Disocclusions in Causal Video Object Segmentation

ICCV 2015
0
citations

Few-Shot Learning With Embedded Class Models and Shot-Free Meta Training

ICCV 2019
0
citations

Unsupervised Domain Adaptation via Regularized Conditional Alignment

ICCV 2019
0
citations

Task2Vec: Task Embedding for Meta-Learning

ICCV 2019
0
citations

Learning Hierarchical Graph Neural Networks for Image Clustering

ICCV 2021arXiv
0
citations

ARCH++: Animation-Ready Clothed Human Reconstruction Revisited

ICCV 2021
0
citations

Unsupervised Depth Completion With Calibrated Backprojection Layers

ICCV 2021arXiv
0
citations

Visual Relationship Detection Using Part-and-Sum Transformers With Composite Queries

ICCV 2021arXiv
0
citations

SAFE: Machine Unlearning With Shard Graphs

ICCV 2023arXiv
0
citations

Linear Spaces of Meanings: Compositional Structures in Vision-Language Models

ICCV 2023arXiv
0
citations

Tangent Model Composition for Ensembling and Continual Fine-tuning

ICCV 2023arXiv
0
citations

Incremental Few-Shot Meta-Learning via Indirect Discriminant Alignment

ECCV 2020
0
citations

Forgetting Outside the Box: Scrubbing Deep Networks of Information Accessible from Input-Output Observations

ECCV 2020
0
citations

Not Just Streaks: Towards Ground Truth for Single Image Deraining

ECCV 2022
0
citations

X-DETR: A Versatile Architecture for Instance-Wise Vision-Language Tasks

ECCV 2022
0
citations

An Empirical Evaluation of Current Convolutional Architectures' Ability to Manage Nuisance Location and Scale Variability

CVPR 2016
0
citations

WorDepth: Variational Language Prior for Monocular Depth Estimation

CVPR 2024
0
citations

Non-autoregressive Sequence-to-Sequence Vision-Language Models

CVPR 2024
0
citations

On the Scalability of Diffusion-based Text-to-Image Generation

CVPR 2024
0
citations

Time Matters in Regularizing Deep Networks: Weight Decay and Data Augmentation Affect Early Learning Dynamics, Matter Little Near Convergence

NeurIPS 2019
0
citations

Predicting Training Time Without Training

NeurIPS 2020
0
citations

Targeted Adversarial Perturbations for Monocular Depth Prediction

NeurIPS 2020
0
citations

Geo-PIFu: Geometry and Pixel Aligned Implicit Functions for Single-view Human Reconstruction

NeurIPS 2020
0
citations

Long Short-Term Transformer for Online Action Detection

NeurIPS 2021
0
citations

Uniform Sampling over Episode Difficulty

NeurIPS 2021
0
citations

On Leave-One-Out Conditional Mutual Information For Generalization

NeurIPS 2022
0
citations

Semi-supervised Vision Transformers at Scale

NeurIPS 2022
0
citations

Leveraging sparse and shared feature activations for disentangled representation learning

NeurIPS 2023
0
citations

Your representations are in the network: composable and parallel adaptation for large scale models

NeurIPS 2023
0
citations

Gacs-Korner Common Information Variational Autoencoder

NeurIPS 2023
0
citations