Silvio Savarese

50
Papers
637
Total Citations

Papers (50)

Learning Transferrable Representations for Unsupervised Domain Adaptation

NeurIPS 2016
281
citations

ULIP-2: Towards Scalable Multimodal Pre-training for 3D Understanding

CVPR 2024
192
citations

HIVE: Harnessing Human Feedback for Instructional Visual Editing

CVPR 2024
164
citations

Data-Driven 3D Voxel Patterns for Object Category Recognition

CVPR 2015
0
citations

Enriching Object Detection With 2D-3D Registration and Continuous Viewpoint Estimation

CVPR 2015
0
citations

Watch-n-Patch: Unsupervised Understanding of Actions and Relations

CVPR 2015
0
citations

DeLay: Robust Spatial Layout Estimation for Cluttered Indoor Scenes

CVPR 2016
0
citations

Social LSTM: Human Trajectory Prediction in Crowded Spaces

CVPR 2016
0
citations

3D Semantic Parsing of Large-Scale Indoor Spaces

CVPR 2016
0
citations

Deep Metric Learning via Lifted Structured Feature Embedding

CVPR 2016
0
citations

Structural-RNN: Deep Learning on Spatio-Temporal Graphs

CVPR 2016
0
citations

Feedback Networks

CVPR 2017arXiv
0
citations

Deep View Morphing

CVPR 2017arXiv
0
citations

Social Scene Understanding: End-To-End Multi-Person Action Localization and Collective Activity Recognition

CVPR 2017arXiv
0
citations

Demo2Vec: Reasoning Object Affordances From Online Videos

CVPR 2018
0
citations

Social GAN: Socially Acceptable Trajectories With Generative Adversarial Networks

CVPR 2018arXiv
0
citations

Taskonomy: Disentangling Task Transfer Learning

CVPR 2018arXiv
0
citations

Im2Pano3D: Extrapolating 360° Structure and Semantics Beyond the Field of View

CVPR 2018
0
citations

Adversarial Feature Augmentation for Unsupervised Domain Adaptation

CVPR 2018arXiv
0
citations

Deep Learning Under Privileged Information Using Heteroscedastic Dropout

CVPR 2018arXiv
0
citations

Gibson Env: Real-World Perception for Embodied Agents

CVPR 2018arXiv
0
citations

TopNet: Structural Point Cloud Decoder

CVPR 2019
0
citations

Scene Memory Transformer for Embodied Agents in Long-Horizon Tasks

CVPR 2019
0
citations

Generalized Intersection Over Union: A Metric and a Loss for Bounding Box Regression

CVPR 2019
0
citations

SoPhie: An Attentive GAN for Predicting Paths Compliant to Social and Physical Constraints

CVPR 2019
0
citations

4D Spatio-Temporal ConvNets: Minkowski Convolutional Neural Networks

CVPR 2019
0
citations

DenseFusion: 6D Object Pose Estimation by Iterative Dense Fusion

CVPR 2019
0
citations

Neural Task Graphs: Generalizing to Unseen Tasks From a Single Video Demonstration

CVPR 2019
0
citations

Topological Planning With Transformers for Vision-and-Language Navigation

CVPR 2021arXiv
0
citations

JRDB-Act: A Large-Scale Dataset for Spatio-Temporal Action, Social Group and Activity Detection

CVPR 2022
0
citations

ULIP: Learning a Unified Representation of Language, Images, and Point Clouds for 3D Understanding

CVPR 2023
0
citations

Procedure-Aware Pretraining for Instructional Video Understanding

CVPR 2023
0
citations

Unsupervised Semantic Parsing of Video Collections

ICCV 2015
0
citations

Action Recognition by Hierarchical Mid-Level Action Elements

ICCV 2015
0
citations

Learning to Track: Online Multi-Object Tracking by Decision Making

ICCV 2015
0
citations

Text2Data: Low-Resource Data Generation with Textual Control

AAAI 2025
0
citations

Lattice Long Short-Term Memory for Human Action Recognition

ICCV 2017arXiv
0
citations

Situational Fusion of Visual Representation for Visual Navigation

ICCV 2019
0
citations

3D Scene Graph: A Structure for Unified Semantics, 3D Space, and Camera

ICCV 2019
0
citations

TRiPOD: Human Trajectory and Pose Dynamics Forecasting in the Wild

ICCV 2021arXiv
0
citations

Generative Sparse Detection Networks for 3D Single-shot Object Detection

ECCV 2020
0
citations

Universal Correspondence Network

NeurIPS 2016arXiv
0
citations

Tracking the Untrackable: Learning to Track Multiple Cues With Long-Term Dependencies

ICCV 2017arXiv
0
citations

Unified Training of Universal Time Series Forecasting Transformers

ICML 2024
0
citations

A Coarse-to-Fine Model for 3D Pose Estimation and Sub-Category Recognition

CVPR 2015
0
citations

Generalizing to Unseen Domains via Adversarial Data Augmentation

NeurIPS 2018
0
citations

Regression Planning Networks

NeurIPS 2019
0
citations

Social-BiGAT: Multimodal Trajectory Forecasting using Bicycle-GAN and Graph Attention Networks

NeurIPS 2019
0
citations

CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning

NeurIPS 2022
0
citations

UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild

NeurIPS 2023
0
citations